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COMPOSITIONS, SPLICE VARIANTS AND METHODS 
RELATING TO COLON SPECIFIC GENES AND PROTEINS 



5 INTRODUCTION 

This application claims the benefit of priority from U.S. Provisional Patent 
Application Serial No. 60/431,132 filed December 4, 2002 and 60/431,144 filed 
December 4, 2002, which are herein incorporated by reference in their entireties. 

FIELD OF THE IN\^NTION 

10 The present invention relates to newly identified nucleic acids and polypeptides 

present in normal and neoplastic colon cells, including fragments, variants and derivatives 
of the nucleic acids and polypeptides. The present invention also relates to antibodies to 
the poljpeptides of the invention, as well as agonists and antagonists of the polypeptides 
of the invention. The invention also relates to compositions comprising the nucleic acids, 

15 polj^peptides, antibodies, post translational modifications (PTMs), variants, derivatives, 
agonists and antagonists thereof and methods for the use of these compositions. These 
uses include identifying, diagnosing, monitoring, staging, imaging and treating colon 
cancer and/or non-cancerous disease states in colon, identifying colon tissue and 
monitoring and identifying and/or designing agonists and antagonists of polj^Deptides of 

20 the invention. The uses also include gene therapy, therapeutic molecules including but not 
limited to antibodies or antisense molecules, production of transgenic animals and cells, 
and production of engineered colon tissue for treatment and research. 

BACKGROUND OF THE INVENTION 
Colorectal cancer is the second most common cause of cancer death in the United 

25 States and the third most prevalent cancer in both men and women. M. L. Davila & A. D. 
Davila, Screening for Colon and Rectal Cancer, in Colon and Rectal Cancer 47 (Peter S. 
Edelstein ed.. 2000). The American Cancer Society estimates that there will be about 
105,500 new cases of colon cancer and 42,000 new cases of rectal cancer in 2003 in the 
United States. Colon cancer and rectal cancer will cause about 57,100 deaths combined. 

30 ACS Website: cancer.org on the worid wide web. Neariy all cases of colorectal cancer 
arise from adenomatous pol>T)S, some of which mature into large polyps, undergo 
abnormal growth and development, and ultimately progress into cancer. Davila at 55-56. 
This progression would appear to take at least 10 years in most patients, rendering it a 
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readily treatable form of cancer if diagnosed early, when the cancer is localized. Davila at 
56; Walter J. Burdette, Cancer: Etiolog^^ Diagnosis, and Treatment 125 (1998). 

Although our understanding of the etiology of colon cancer is undergoing 
continual refinement, extensive research in this area points to a combination of factors, 
5 including age, hereditary and nonhereditary conditions, and environmental/dietary factors. 
Age is a key risk factor in the development of colorectal cancer, Davila at 48, with men 
and women over 40 years of age becoming increasingly susceptible to that cancer. 
Burdette at 1 26. Incidence rates increase considerably in each subsequent decade of life. 
Davila at 48. A number of hereditary and nonhereditary conditions have also been linked 

10 to a heightened risk of developing colorectal cancer, including familial adenomatous 

polyposis (FAP), hereditar>' nonpohposis colorectal cancer (Lynch syndrome or HNPCC), 
a personal and/or family history of colorectal cancer or adenomatous polyps, inflammatory 
bowel disease, diabetes mellitus, and obesitj'. Davila at 47: Henry T. Lynch & Jane F. 
Lynch, Hereditajy Nonpolyposis Colorectal Cancer (Lynch Syndromes), in Colon and 

1 5 Rectal Cancer 67-68 (Peter S. Edelstein ed., 2000). 

Environmental/dietary factors associated with an increased risk of colorectal 
cancer include a high fat diet, intake of high dietary red meat, and sedentary lifestjie. 
Davila at 47; Reddy, B. S., Prev. Med, 16(4): 460-7 (1987). Conversely, 
environmental/dietary factors associated with a reduced risk of colorectal cancer include a 

20 diet high in fiber, folic acid, calcium, and hormone-replacement therapy in post- 
menopausal women. Davila at 50-55. The effect of antioxidants in reducing the risk of 
colon cancer is unclear. Davila at 53. 

Because colon cancer is highly treatable when detected at an early, localized stage, 
screening should be a part of routine care for all adults starting at age 50, especially those 

25 with first-degree relatives with colorectal cancer. One major advantage of colorectal 
cancer screening over its counterparts in other tj^pes of cancer is its ability to not only 
detect precancerous lesions, but to remove them as well. Davila at 56. The key colorectal 
cancer screening tests in use today are fecal occult blood test, sigmoidoscopy, 
colonoscopy, double-contrast barium enema, and the carcinoembryonic antigen (CEA) 

30 test. Burdette at 125; Davila at 56. 

The fecal occult blood test (FOBT) screens for colorectal cancer by detecting the 
amount of blood in the stool, the premise being that neoplastic tissue, particularly 
malignant tissue, bleeds more than typical mucosa, with the amount of bleeding increasing 
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with pohnp size and cancer stage. Davila at 56-57. effective at detecting early stage 

tumors, FOBT is unable to detect adenomatous poI>Tps (premalignant lesions), and, 
depending on the contents of the fecal sample, is subject to rendering false positives. 
Davila at 56-59. Sigmoidoscopy and colonoscopy, by contrast, allow direct visualization 
5 of the bowel, and enable one to detect, biopsy, and remove adenomatous pol\T)s. Davila at 
59-60, 61, Despite the advantages of these procedures, there are accompanying 
downsides: sigmoidoscopy, by defmition, is limited to the sigmoid colon and below, 
colonoscopy is a relatively expensive procedure, and both share the risk of possible bowel 
perforation and hemorrhaging. Davila at 59-60. Double-contrast barium enema (DCBE) 
10 enables detection of lesions better than FOBT, and almost as well a colonoscopy, but it 
may be limited in evaluating the winding rectosigmoid region. Davila at 60, The CEA 
blood test, which involves screening the blood for carcinoembryonic antigen, shares the 
downside of FOBT, in that it is of limited utilitj' in detecting colorectal cancer at an early 
stage. Burdette at 125. 

15 Once colon cancer has been diagnosed, treatment decisions are typically made in 

reference to the stage of cancer progression. A number of techniques are employed to 
stage the cancer (some of which are also used to screen for colon cancer), including 
pathologic examination of resected colon, sigmoidoscopy, colonoscopy, and various 
imaging techniques, AJCC Cancer Staging Handbook 84 (Ir\'in D. Fleming et al, eds., 5'^ 

20 ed. 199S); Montgomery, R. C. and Ridge, J.A., SemiiK Surg. Oncol. 15(3): 143-150 
(1998), Moreover, chest films, liver functionalitj' tests, and liver scans are employed to 
determine the extent of metastasis. Fleming at 84. While computerized tomography and 
magnetic resonance imaging are useful in staging colorectal cancer in its later stages, both 
have unacceptably low staging accuracy for identifying early stages of the disease, due to 

25 the difficulty that both methods have in (1) revealing the depth of bowel wall tumor 

infiltration and (2) diagnosing malignant adenopathy, Thoeni, R. F., Radiol Clin, K Am. 
35(2): 457-85 (1997). Rather, techniques such as transrectal ultrasound (TRUS) are 
preferred in this context, although this technique is inaccurate with respect to detecting 
small lymph nodes that may contain metastases. David Blumberg & Frank G. Opelka, 

30 Neoadjuvant and Adjuvant Tljerapyfor Adenocarcinoma of the Rectum, in Colon and 
Rectal Cancer 316 (Peter S. Edelstein ed., 2000). 

Several classification systems have been devised to stage the extent of colorectal 
cancer, including the Dukes' system and the more detailed International Union against 
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Cancer- American Joint Committee on Cancer TNM staging system, which is considered 
by many in the field to be a more useful staging system. Burdette at 126-27. The TNM 
system, which is used for either clinical or pathological staging, is divided into four stages, 
each of which evaluates the extent of cancer gro\\1h with respect to primaiy tumor (T), 
5 regional lymph nodes (N), and distant metastasis (M). Fleming at S4-85. The system 
focuses on the extent of tumor invasion into the intestinal wall, invasion of adjacent 
structures, the number of regional lymph nodes that have been affected, and whether 
distant metastasis has occurred. Fleming at 81 . 

Stage 0 is characterized by in situ carcinoma (Tis), in which the cancer cells are 
10 located inside the glandular basement membrane (intraepithelial) or lamina propria 

(intramucosal). In this stage, the cancer has not spread to the regional lymph nodes (NO), 
and there is no distant metastasis (MO). In stage I, there is still no spread of the cancer to 
the regional lymph nodes and no distant metastasis, but the tumor has invaded the 
submucosa (Tl) or has progressed further to invade the muscularis propria (T2). Stage II 
15 also involves no spread of the cancer to the regional lymph nodes and no distant 

metastasis, but the tumor has invaded the subserosa, or the nonperitonealized pericolic or 
perirectal tissues (T3), or has progressed to invade other organs or structures, and/or has 
perforated the visceral peritoneum (T4). Stage III is characterized by any of the T 
substages, no distant metastasis, and either metastasis in 1 to 3 regional lymph nodes (Nl) 
20 or metastasis in four or more regional lymph nodes (N2). Lastly, stage IV involves any of 
the T or N substages, as well as distant metastasis. Fleming at 84-85; Burdette at 127. 

Currently, pathological staging of colon cancer is preferable over clinical staging 
as pathological staging provides a more accurate prognosis. Pathological staging typically 
involves examination of the resected colon section, along with surgical examination of the 
5 abdominal cavity. Fleming at 84. Clinical staging would be a preferred method of staging 
were it at least as accurate as pathological staging, as it does not depend on the invasive 
procedures of its counterpart. 

Turning to the treatment of colorectal cancer, surgical resection results in a cure for 
roughly 50% of patients. Irradiation is used both preoperative^ and postoperatively in 
0 treating colorectal cancer. Cheraotherapeutic agents, particulariy 5-fluorouracil, are also 
powerful weapons in treating colorectal cancer. Other agents include irinotecan and 
floxuridine, cisplatin, levamisole, methotrexate, intefferon-a, and leucovorin. Burdette at 
125, 132-33. Nonetheless, thirty to forty percent of patients will develop a recurrence of 
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colon cancer following surgical resection, which in many patients is the ultimate cause of 
death. Wayne De Vos, Follow-up After Treatment of Colon Cancer, Colon and Rectal 
Cancer 225 (Peter S. Edelstein ed., 2000). Accordingly, colon cancer patients must be 
closely monitored to determine response to therapy and to detect persistent or recurrent 
5 disease and metastasis. 

The next few paragraphs describe the some of molecular bases of colon cancer. In 
the case of FAP, the tumor suppressor gene AFC (adenomatous polyposis coli), 
chromosomally located at 5q21, has been either inactivated or deleted by mutation. 
Alberts et al., Molecular Biology of the Cell 1288 (3d ed. 1994). The AFC protein plays a 

10 role in a number of functions, including cell adhesion, apoptosis, and repression of the c- 
myc oncogene. N. R. Hall & R. D. Madoff, Genetics and the Polyp-Cancer Sequence, 
Colon and Rectal Cancer 8 (Peter S. Edelstein, ed,, 2000). Of those patients with 
colorectal cancer who have nomial APC genes, over 65% have such mutations in the 
cancer cells but not in other tissues. Alberts et al., supra at 1288. In the case of HPNCC, 

1 5 patients manifest abnormalities in the tumor suppressor gene HNPCC, but only about 1 5% 
of tumors contain the mutated gene. Id. A host of other genes have also been implicated 
in colorectal cancer, including the Y^-ras, Yi-ras, H-ras and c-myc oncogenes, and the 
tumor suppressor genes DCC (deleted in colon carcinoma) and p53. Hall & Madoff, at 8- 
9; Alberts et al., at 12S8. 

20 Abnormalities in Wg/Wnt signal transduction pathway are also associated with the 

development of colorectal carcinoma. Taipale, J, and Beachy, P. A. Nature 411: 349-354 
(2001). Wntl is a secreted protein gene originally identified within mouse mammar>' 
cancers by its insertion into the mouse mammary tumor virus (MMTV) gene. The protein 
is homologous to the wingless (Wg) gene product of Drosophila, in which it functions as 

25 an important factor for the determination of dorsal-ventral segmentation and regulates the 
formation of fly imaginal discs. Wg/Wnt pathway controls cell proliferation, death and 
differentiation. Taipal (2001). There are at least 13 members in the Wnt family. These 
proteins have been found expressed mainly in the central nen'ous system (CNS) of 
vertebrates as well as other tissues such as mammar>' and intestine. The Wnt proteins are 

30 the ligands for a family of seven transmembrane domain receptors related to the Frizzled 
gene product in Drosophila. Binding Wnt to Frizzled stimulates the activity of the 
downstream target. Dishevelled, which in turn inactivates the glycogen synthesase kinase • 
3P (GSK3P). Taipal (2001). Usually active GSK3P will fomi a complex with the 
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adenomatous pol>'posis coli (APC) protein and phosphor>'late another complex member, 
P-catenin. Once phosphorj'lated, p-catenin is directed to degradation through the ubiquitin 
pathway. When GSK3P or APC activity is down regulated, P-catenin is accumulated in 
the c>aoplasm and binds to the T-cell factor or lymphocyte excitation factor (Tcf/Lef) 
5 family of transcriptional factors. Binding of P-catenin to Tcf releases the transcriptional 
repression and induces gene transcription. Among the genes regulated by P-catenin are a 
transcriptional repressor Engrailed, a transforming growth factor-p (TGF-P) family 
member Decapentaplegic, and the cytokine Hedgehog in Drosophila. P-Catenin is also 
involved in regulating cell adhesion by binding to a-catenin and E-cadherin. On the other 
10 hand, binding of p-catenin to these proteins controls the c>toplasmic p-catenin level and 
its complexing with TCF. Taipal (2001). Growth factor stimulation and activation of c- 
src or v-src also regulate P-catenin level by phosphorylation of a-catenin and its related 
protein, pl20*'". When phosphorj'lated, these proteins decrease their binding to E- 
cadherin and P-catenin resulting in the accumulation of cyloplasmic P-catenin. Reynolds, 
15 A.B. et al. Mol Cell Biol 14: 8333-S342 (1994). In colon cancer, c-src enzymatic activity 
has been shown to be increased to the level of v-src. Alternation of components in the 
Wg/U^nt pathway promotes colorectal carcinoma development. The best known 
modifications are to the APC gene. Nicola S et al. Hum, Mol Genet 10:721-733 (2001). 
This gemiline mutation causes the appearance of hundreds to thousands of adenomatous 
20 pol>ps in the large bowel. It is the gene defect that accounts for the autosomally 

dominantly inherited FAP and related syndromes. The molecular alternations that occur 
in this pathway largely involve deletions of alleles of tumor-suppressor genes, such as 
APC, p53 and Deleted in Colorectal Cancer (DCC), combined with mutational activation 
of proto-oncogenes, especially c-Ki-ras. Aoki, T. et al. Human Mutat, 3: 342-346 (1994). 
25 All of these lead to genomic instability in colorectal cancers. 

Another source of genomic instability in colorectal cancer is the defect of DNA 
mismatch repair (MMR) genes. Human homologues of the bacterial mutWLS complex 
(hMSH2, hMLHl, hPMSl, hPMS2 and hMSH6), which is involved in the DNA mismatch 
repair in bacteria, have been shown to cause the HNPCC (about 70-90% HNPCC) when 
30 mutated. Modrich, P. and Lahue, R, Ann Rev, Biochem, 65: 101-133 (1996); and 

Peltomaki, P. Hum. Mol Genet 10: 735-740 (2001). The inactivation of these proteins 
leads to the accumulation of mutations and causes genetic instability that represents errors 
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in the accurate replication of the repetitive mono-, di-, tri- and tetra-nucleotide repeats, 
which are scattered throughout the genome (microsatellite regions). Jass, J,R. et al. / 
Gastroenterol Hepatol 17: 17-26 (2002). Like in the classic FAP, mutational activation of 
c-Ki-ras is also required for the promotion of MSI in the alternative HNPCC. Mutations 
5 in other proteins such as the tumor suppressor protein phosphatase PTEN (Zhou, X.P. et 
al. Hum, Mol. Genet 11: 445-450 (2002)), BAX (Buttler, L.M. Aus. N, Z. J. Surg. 69: 88- 
94 (1999)), Caspase-5 (Planck, M. Cancer Genet Cytogeuet. 134: 46-54 (2002)), TGF3- 
RH (Fallik, D. et al. Gastroenterol Clin Biol. 24: 917-22 (2000)) and IGFII-R 
(Giovannucci E. J. Nutr, 131 : 3109S-20S (2001)) have also been found in some colorectal 

1 0 tumors possibly as the cause of MMR defect. 

Some tyrosine kinases have been shown up-regulated in colorectal tumor tissues or 
cell lines like HT29. Skoudy, A. et al. / 317 (Pt 1): 279-84 (1996). Focal 

adhesion kinase (FAK) and its up-stream kinase c-src and c-yes in colonic epithelial cells 
may play an important role in the promotion of colorectal cancers through the extracellular 

15 matrix (ECM) and integrin-mediated signaling pathwaj's. Jessup, J.M. et al., Tlie 

molecular biology of colorectal carcinomay in: The Molecular Basis of Human Cancer, 
251-268 (Coleman W.B. and Tsongalis G.J. Eds. 2002). The formation of c-src/FAK 
complexes may coordinately deregulate VEGF expression and apoptosis inhibition. 
Recent evidences suggest that a specific signal-transduction pathway for cell survival that 

20 implicates integrin engagement leads to FAK activation and thus activates PI-3 kinase and 
akt. In turn, akt phosphorylates BAD and blocks apoptosis in epithelial cells. The 
activation of c-src in colon cancer may induce VEGF expression through the hypoxia 
pathway. Other genes that may be implicated in colorectal cancer include Cox enzymes 
(Ota, S. et al. Aliment Pharmacol Ther. 16 (Suppl 2): 102-106 (2002)), estrogen (al- 

25 Azzawi, F. and Wahab, M. Climacteric 5: 3-14 (2002)), peroxisome proliferator-activated 
receptor-y (PPAR-y) (Gelman, L. et al. Cell Mol Life Sci 55: 932-943 (1999)), IGF-I 
(Giovannucci (2001)), thymine DNA glycosylase (TDG) (Hardeland, U. et al. Prog, 
Nucleic Acid Res. Mol Biol 68: 235-253 (2001)) and EGF (Mendelsohn, J. Endocrine- 
Related Cancer 8: 3-9 (2001)). 

30 Gene deletion and mutation are not the only causes for development of colorectal 

cancers. Epigenetic silencing by DNA methylation also accounts for the loss of function 
of colorectal cancer suppressor genes. A strong association between MSI and CpG island 
methylation has been well characterized in sporadic colorectal cancers with high MSI but 
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not in those of hereditary origin. In one experiment, DNA methylation of MLHl, 
CDKN2A, MGMT, THBSl, FL^, APC, and pMARF genes has been sho\\Ti in S0%, 
55%, 23%, 23%, 58%, 35%, and 50% of 40 sporadic colorectal cancers with high MSI 
respectively. Yamamoto, H. et al. Genes Chromosomes Cancer 33: 322-325 (2002); and 
5 Kim, K.M. et al. Oncogene, 12;21(35): 5441-9 (2002). Carcinogen metabolism enzymes 
such as GST, NAT, CYP and MTHFR are also associated with an increased or decreased 
colorectal cancer risk. Pistorius, S. et al. Kongressbd Dtsch Ges ChirKongr 118: 820-824 
(2001); and Potter, J.D. 1 Natl Cancer InsL 91: 916-932 (1999). 

From the foregoing, it is clear that procedures used for detecting, diagnosing, 

10 monitoring, staging, prognosticating, and preventing the recurrence of colorectal cancer 
are of critical importance to the outcome of the patient. Moreover, current procedures, 
while helpful in each of these analyses, are limited by their specificity, sensitivit)', 
invasiveness, and/or their cost. As such, highly specific and sensitive procedures that 
would operate by way of detecting novel markers in cells, tissues, or bodily fluids, with 

15 minimal invasiveness and at a reasonable cost, would be highly desirable. 

Accordingly, there is a great need for more sensitive and accurate methods for 
predicting whether a person is likely to develop colorectal cancer, for diagnosing 
colorectal cancer, for monitoring the progression of the disease, for staging the colorectal 
cancer, for deteimining whether the colorectal cancer has metastasized, and for imaging 

20 the colorectal cancer. Following accurate diagnosis, there is also a need for less invasive 
and more effective treatment of colorectal cancen 

Growth and metastasis of solid tumors are also dependent on angiogenesis. 
Folkman, J., 1986, Cancer Research, 46, 467-473; Folkman, J., 1989, Journal of the 
National Cancer Institute^ 82, 4-6. It has been shown, for example, that tumors which 

25 enlarge to greater than 2 mm must obtain their own blood supply and do so by inducing 
the growth of new capillary blood vessels. Once these new blood vessels become 
embedded in the tumor, they provide a means for tumor cells to enter the circulation and 
metastasize to distant sites such as liver, lung or bone. Weidner, N., et a/., 1991, The New 
England Journal of Medicine, 324(1), 1-8. 

30 Angiogenesis, defined as the growth or sprouting of new blood vessels from 

existing vessels, is a complex process that primarily occurs during embryonic 
development. The process is distinct from vasculogenesis, in that the new endothelial cells 
lining the vessel arise from proliferation of existing cells, rather than differentiating from 
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Stem cells. The process is invasive and dependent upon proteolysis of the extracellular 
matrix (ECM), migration of new endothelial cells, and synthesis of new matrix 
components. Angiogenesis occurs during embryogenic development of the circulator 
system; however, in adult humans, angiogenesis only occurs as a response to a 
5 pathological condition (except during the reproductive cycle in women). 

Under normal physiological conditions in adults, angiogenesis takes place only in 
very restricted situations such as hair growlh and wounding healing. Auerbach, W. and 
Auerbach, R., 1994, Pharmacol ner. 63(3):265-3 11; Ribatti et al.J991, Haematologica 
76(4):3 1 1-20; Risau, 1997, Nature 386(6626):67 1-4. Angiogenesis progresses by a 

10 stimulus which results in the formation of a migrating column of endothelial cells. 

Proteolytic activity is focused at the advancing tip of this "vascular sprout", which breaks 
down the ECM sufficiently to permit the column of cells to infiltrate and migrate. Behind 
the advancing front, the endothelial cells differentiate and begin to adhere to each other, 
thus forming a new basement membrane. The cells then cease proliferation and finally 

1 5 define a lumen for the new arteriole or capillary. 

Unregulated angiogenesis has gradually been recognized to be responsible for a 
wide range of disorders, including, but not limited to, cancer, cardiovascular disease, 
rheumatoid arthritis, psoriasis and diabetic retinopathy. Folkman, 1995, Nat Med 1(1):27- 
31; Isner, 1999, Circulation 99(13): 1653-5; Koch, \99%, Arthritis Rheum 41(6):951-62; 

20 Walsh, 1999, Rheumatology (Oxford) 38(2):103-12; Ware and Simons, 1997, Nat Med 
3(2): 158-64. 

Of particular interest is the observation that angiogenesis is required by solid 
tumors for their growth and metastases. Folkman, 1986 supra\ Folkman 1990, J Natl 
Cancerlnst., 82(1) 4-6; Folkman, l992,Semin Ca//cer 3(2):65-71; Zetter, 199S,.47/;2w 

25 ^^v Med 49:407-24. A tumor usually begins as a single aberrant cell which can proliferate 
only to a size of a few cubic millimeters due to the distance from available capillary beds, 
and it can stay 'dormant' without further growth and dissemination for a long period of 
time. Some tumor cells then switch to the angiogenic phenotj'pe to activate endothelial 
cells, which proliferate and mature into new capillary blood vessels. These newly formed 

30 blood vessels not only allow for continued growth of the primaiy tumor, but also for the 
dissemination and recolonization of metastatic tumor cells. The precise mechanisms that 
control the angiogenic switch is not well understood, but it is believed that 
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neovascularization of tumor mass results from the net balance of a multitude of 
angiogenesis stimulators and inhibitors Folkman, 1995, supra. 

One of the most potent angiogenesis inhibitors is endostatin identified by O'Reilly 
and Folkman. O'Reilly et al., 1997, Cell SS(2):277-85; O'Reilly et al, 1994, Cell 79(2):3 
5 15-28. Its discover}^ was based on the phenomenon that certain primary tumors can inhibit 
the growth of distant metastases. O'Reilly and Folkman h>T)othesized that a primary tumor 
initiates angiogenesis by generating angiogenic stimulators in excess of inhibitors. 
However, angiogenic inhibitors, by virtue of their longer half life in the circulation, reach 
the site of a secondarj' tumor in excess of the stimulators. The net result is the growth of 

10 primary tumor and inhibition of secondarj' tumor. Endostatin is one of a growing list of 
such angiogenesis inhibitors produced by primary tumors. It is a proteolytic fragment of a 
larger protein: endostatin is a 20 kDa fragment of collagen XVIII (amino acid HI 132- 
K1315 in murine collagen XVIII). Endostatin has been shown to specifically inhibit 
endothelial cell proliferation in vitro and block angiogenesis in vivo. More importantly, 

1 5 administration of endostatin to tumor-bearing mice leads to significant tumor regression, 
and no toxicity or drug resistance has been observed even after multiple treatment cycles. 
Boehm et al., 1997, Nature 390(6658):404-407. The fact that endostatin targets genetically 
stable endothelial cells and inhibits a variet>' of solid tumors makes it a very attractive 
candidate for anticancer therapy. Fidler and Ellis, 1994, Cell 79(2): 185-8; Gastl et al., 

20 1997, Oncology 54(3): 177-84; Hinsbergh et al., 1999, Ann Oncol 10 Suppl 4:60-3. In 
addition, angiogenesis inhibitors have been shown to be more effective when combined 
with radiation and cheniotherapeutic agents. Klement, 2000, J. Clin Invest, 105(8) R15- 
24. Browder, 2000, Cancer Res. 6-(7) 1878-86, Arap et al, 1998, Science 279(5349):377- 
80; Mauceri et al., 1998, Nature 394(6690):287-9L 

25 SUMMARY OF THE INVENTION 

The present invention solves many needs in the art by providing nucleic acid 
molecules, polj'peptides and antibodies thereto, variants and derivatives of the nucleic 
acids and polj'peptides, and agonists and antagonists thereto that may be used to identify, 
diagnose, monitor, stage, image and treat colon cancer and/or non-cancerous disease states 
30 in colon; identify and monitor colon tissue; and identify and design agonists and 
, antagonists of polj'peptides of tlie invention. The invention also provides gene therapy, 
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methods for producing transgenic animals and cells, and methods for producing 
engineered colon tissue for treatment and research. 

One aspect of the present invention relates to nucleic acid molecules that are 
specific to colon cells, colon tissue and/or the colon organ. These colon specific nucleic 
5 acids (CSNAs) may be a naturally occurring cDNA, genomic DNA, RNA, or a fragment 
of one of these nucleic acids, or may be a non-naturally occurring nucleic acid molecule. 
If the CSNA is genomic DNA, then the CSNA is a colon specific gene (CSG). If the 
CSNA is RNA, then it is a colon specific transcript encoded by a CSG. Due to alternative 
splicing and transcriptional modification one CSG may encode for multiple colon specific 

10 RNAs. In a preferred embodiment, the nucleic acid molecule encodes a polypeptide that 
is specific to colon. More preferred is a nucleic acid molecule that encodes a polypeptide 
comprising an amino acid sequence of SEQ ID NO: 96-237. In another preferred 
embodiment, the nucleic acid molecule comprises a nucleic acid sequence of SEQ ID NO: 
1-95. For the CSNA sequences listed herein, DEX0448_001.nt.l corresponds to SEQ ID 

15 NO: 1. For sequences with multiple splice variants, the parent sequence 

DEX0448_001.nt.l, will be followed by DEX044S_001.nt.2, etc. for each splice variant. 
The sequences off the corresponding peptides are listed as DEX044S_001.aa.l, etc. For 
the mapping of all of the nucleotides and peptides, see the table in the Example 1 section 
below. 

20 This aspect of the present invention also relates to nucleic acid molecules that 

selectively hybridize or exhibit substantial sequence similarity to nucleic acid molecules 
encoding a Colon Specific Protein (CSP), or that selectively hybridize or exhibit 
substantial sequence similarity to a CSNA. In one embodiment of the present invention 
the nucleic acid molecule comprises an allelic variant of a nucleic acid molecule encoding 

25 a CSP, or an allelic variant of a CSNA. In another embodiment, the nucleic acid molecule 
comprises a part of a nucleic acid sequence that encodes a CSP or a part of a nucleic acid 
sequence of a CSNA. 

In addition, this aspect of the present invention relates to a nucleic acid molecule 
further comprising one or more expression control sequences controlling the transcription 

30 and/or translation of all or a part of a CSNA or the transcription and/or translation of a 
nucleic acid molecule that encodes all or a fragment of a CSP. 

" Another aspect of the present invention relates to* vectors and/or host cells 
comprising a nucleic acid molecule of this invention. In a preferred embodiment, the 
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nucleic acid molecule of the vector and^or host cell encodes all or a fragment of a CSP, In 
another preferred embodiment, the nucleic acid molecule of the vector and/or host cell 
comprises all or a part of a CSNA. Vectors and host cells of the present invention are 
usefiil in the recombinant production of pol>'peptides, particularly CSPs of the present 
5 invention. 

Another aspect of the present invention relates to pol>'peptides encoded by a 
nucleic acid molecule of this invention. The polypeptide may comprise either a fragment 
or a full-length protein. In a preferred embodiment, the pol>TDeptide is a CSP. However, 
this aspect of the present invention also relates to mutant proteins (muteins) of CSPs, 

10 fusion proteins of which a portion is a CSP, and proteins and polypeptides encoded by 
allelic variants of a CSNA as provided herein. 

A further aspect of the present invention is a novel splice variant which encodes an 
amino acid sequence that provides a novel region to be targeted for the generation of 
reagents that can be used in the detection and/or treatment of cancer. The novel amino 

15 acid sequence may lead to a unique protein structure, protein subcellular localization, 

biochemical processing or fiinction. This information can be used to directly or indirectly 
facilitate the generation of additional or novel therapeutics or diagnostics. The nucleotide 
sequence in this novel splice variant can be used as a nucleic acid probe for the diagnosis 
and/or treatment of cancer, 

20 Another aspect of the present invention relates to antibodies and other binders that 

specifically bind to a polypeptide of the instant invention. Accordingly antibodies or 
binders of the present invention specifically bind to CSPs, muteins, fusion proteins, and/or 
homologous proteins or polypeptides encoded by allelic variants of a CSNA as provided 
herein. 

25 Another aspect of the present invention relates to agonists and antagonists of the 

nucleic acid molecules and polypeptides of this invention. The agonists and antagonists of 
the instant invention may be used to treat colon cancer and non-cancerous disease states in 
colon and to produce engineered colon tissue. 

Another aspect of the present invention relates to methods for using the nucleic 

30 acid molecules to detect or amplifj' nucleic acid molecules that have similar or identical 
nucleic acid sequences compared to the nucleic acid molecules described herein. Such 
methods are useful in identifying, diagnosing, monitoring, staging, imaging and treating 
colon cancer and/or non-cancerous disease states in colon. Such methods are also useful 
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in identifying and/or monitoring colon tissue. In addition, measurement of levels of one or 
more of the nucleic acid molecules of this invention may be useful as a diagnostic as part 
of a panel in combination with known other markers, particularly those described in the 
colon cancer background section above. 

5 Another aspect of the present invention relates to use of the nucleic acid molecules 

of this invention in gene therapy, for producing transgenic animals and cells, and for 
producing engineered colon tissue for treatment and research. 

Another aspect of the present invention relates to methods for detecting 
polypeptides of this invention, preferably using antibodies thereto. Such methods are 

10 useful to identify, diagnose, monitor, stage, image and treat colon cancer and non- 
cancerous disease states in colon. In addition, measurement of levels of one or more of 
the polypeptides of this invention may be useful to identify, diagnose, monitor, stage, 
and/or image colon cancer in combination with known other markers, particulariy those 
described in the colon cancer background section above. The polypeptides of the present 

1 5 invention can also be used to identify^ and/or monitor colon tissue, and to produce 
engineered colon tissue. 

Yet another aspect of the present invention relates to a computer readable means of 
storing the nucleic acid and amino acid sequences of the invention. The records of the 
computer readable means can be accessed for reading and displaying of sequences for 

20 comparison, alignment and ordering of the sequences of the invention to other sequences. 
In addition, the computer records regarding the nucleic acid and/or amino acid sequences 
and/or measurements of their levels may be used alone or in combination with other 
markers to diagnose colon related diseases. 

DETAILED DESCRIPTION OF THE INVENTION 
25 Definitions and General Techniques 

Unless othewise defined herein, scientific and technical terms used in connection 
with the present invention shall have the meanings that are commonly understood by those 
of ordinary skill in the art. Further, unless otherwise required by context, singular terms 
shall include pluralities and plural terms shall include the singular. Generally, 
30 nomenclatures used in connection with, and techniques of, cell and tissue culture, 
molecular biology, immunology, microbiology, genetics and protein and nucleic acid 
chemistry and hybridization described herein are those well known and commonly used in 
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the art. The methods and techniques of the present invention are generally performed 
according to conventional methods well known in the art and as described in various 
general and more specific references that are cited and discussed throughout the present 
specification unless otherwise indicated. See, e.g., Sambrook et a/.. Molecular Cloning: A 
5 Laboratory Manual. 2d ed., Cold Spring Harbor Laboratory Press (1989) and Sambrook et 
Molecular Cloning: A Laboratory Manual . 3d ed., Cold Spring Harbor Press (2001); 
Ausubel et ai, Current Protocols in Molecular Biology . Greene Publishing Associates 
(1992, and Supplements to 2000); Ausubel et al, Short Protocols in Molecular Biologv: A 
Compendiu m of Methods from Current Protocols in Molecular Biolog^^ - 4^^ Ed. . Wiley & 
1 0 Sons (1 999); Harlow and Lane, Antibodies: A Laboratory Manual . Cold Spring Harbor 
Laboratory Press (1990); and Harlow and Lane, Using Antibodies: A Laborator\^ Manual . 
Cold Spring Harbor Laboratory Press (1999). 

Enzj^matic reactions and purification techniques are performed according to 
manufacturer's specifications, as commonly accomplished in the art or as described 
1 5 herein. The nomenclatures used in connection with, and the laboratorj' procedures and 
techniques of, analjtical chemistry, synthetic organic chemistry, and medicinal and 
pharmaceutical chemistry described herein are those well known and commonly used in 
the art. Standard techniques are used for chemical syntheses, chemical analyses, 
pharmaceutical preparation, formulation, and delivery, and treatment of patients. 
20 The following temis, unless otherwise indicated, shall be understood to have the 

following meanings: 

A "nucleic acid molecule" of this invention refers to a polymeric form of 
nucleotides and includes both sense and antisense strands of RNA. cDNA, genomic DNA, 
and synthetic forms and mixed polymers of the above. A nucleotide refers to a 
25 ribonucleotide, deoxynucleotide or a modified form of either tj^pe of nucleotide. A 
"nucleic acid molecule" as used herein is synonymous with "nucleic acid" and 
"polynucleotide." The term "nucleic acid molecule" usually refers to a molecule of at 
least 10 bases in length, unless otherwise specified. The term includes single- and double- 
stranded forms of DNA. In addition, a polynucleotide may include either or both naturally 
30 occurring and modified nucleotides linked together by naturally occurring and/or 
non-natural ly occurring nucleotide linkages. 

Nucleotides are represented by single letter sjoiibols in nucleic acid molecule 
sequences. The following table lists sj'mbols identifying nucleotides or groups of 
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nucleotides which may occupy the s>anbol position on a nucleic acid molecule. See 
Nomenclature Committee of the Intemational Union of Biochemistiy (NC-IUB), 



Nomenclature for incompletely specified bases in nucleic acid sequences, 
Recommendations \9S4., Eur J Biochem. 150(l):lo (1985). 



Symbol 


Meaning 


Group/Origin of Designation 


Complementary 

S^jTTlbol 


a 


a 


Adenine 


t/u 


g 


g 


Guanine 


c 


c 


c 


Cytosine 


g 


t 


t 


Thymine 


a 


u 


u 


Uracil 


a 


r 


g or a 


DuRine 


y 


y 


t/u or c 


pYrimidine 


r 


m 


a or c 


aMino 


k 


k 


g or t/u 


Keto 


m 


5 


g or c 


Strong interactions 3H-bonds 


w 


W 


a or t/u 


Weak interactions 2H-bonds 


s 


b 


g or c or t/u 


not a 


V 


d 


a or g or t/u 


not c 


h 


h 


a or c or t/u 


not g 


d 


V 


a or g or c 


not t, not u 


b 


n 


a or g or c 
or t/u, 
unknown, or 
other 


aNy 


n 



The nucleic acid molecules may be modified chemically or biochemically or may 
contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those 
of skill in the art. Such modifications include, for example, labels, methylation, 
substitution of one or more of the naturally occumng nucleotides with an analog, 

10 iniemucleotide modifications such as uncharged linkages {e.g., methyl phosphonaies, 
phosphotriesters, phosphoramidates, carbamates, etc.), charged linkages (e.g., 
phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g., pol>peptides), 
intercalators {e.g., acridine, psoralen, etc.), chelators, alkylators, and modified linkages 
(e.g, alpha anomeric nucleic acids, etc.) The term "nucleic acid molecule" also includes 

15 any topological conformation, including single-stranded, double-stranded, partially 

duplexed, triplexed, hairpinned, circular and padlocked conformations. Also included are 
synthetic molecules that mimic polynucleotides in their ability to bind to a designated 
sequence via hydrogen bonding and other chemical interactions. Such molecules are 
known in the art and include, for example, those in which peptide linkages substitute for 

20 " phosphate linkages in the backbone of the molecule. 
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A "gene" is defined as a nucleic acid molecule that comprises a nucleic acid 
sequence that encodes a poK^peptide and the expression control sequences that surround 
the nucleic acid sequence that encodes the polypeptide. For instance, a gene may 
comprise a promoter, one or more enhancers, a nucleic acid sequence that encodes a 
5 polypeptide, dowoistream regulatory sequences and, possibly, other nucleic acid sequences 
involved in regulation of the expression of an RNA. As is well kno\\Ti in the art, 
eukaryotic genes usually contain both exons and introns. The term "exon" refers to a 
nucleic acid sequence found in genomic DNA that is bioinformatically predicted and/or 
experimentally confirmed to contribute contiguous sequence to a mature mRNA 

10 transcript. The term "intron" refers to a nucleic acid sequence found in genomic DNA that 
is predicted and/or confirmed to not contribute to a mature mRNA transcript, but rather to 
be ''spliced out" during processing of the transcript. 

A nucleic acid molecule or pohi^eptide is "derived" from a particular species if the 
nucleic acid molecule or poh'peptide has been isolated from the particular species, or if the 

15 nucleic acid molecule or pol>'peptide is homologous to a nucleic acid molecule or 
poljpeptide isolated from a particular species. 

An "isolated" or "substantially pure" nucleic acid or polynucleotide (e.g., an RNA, 
DNA or a mixed polymer) is one which is substantially separated firom other cellular 
components that naturally accompany the native polynucleotide in its natural host cell, 

20 e.g., ribosomes, polymerases, or genomic sequences with which it is naturally associated. 
The term embraces a nucleic acid or polynucleotide that (1) has been removed from its 
naturally occurring environment, (2) is not associated with all or a portion of a 
polynucleotide in which the "isolated polynucleotide" is found in nature, (3) is operatively 
linked to a polynucleotide which it is not linked to in nature, (4) does not occur in nature 

25 as part of a larger sequence or (5) include^ nucleotides or intemucleoside bonds that are 
not found in nature. The term "isolated" or "substantially pure" also can be used in 
reference to recombinant or cloned DNA isolates, chemically synthesized polynucleotide 
analogs, or polynucleotide analogs that are biologically synthesized by heterologous 
systems. The term "isolated nucleic acid molecule" includes nucleic acid molecules that 

30 are integrated into a host cell chromosome at a heterologous site, recombinant fusions of a 
native fragment to a heterologous sequence, recombinant vectors present as episomes or as 
integrated into a host cell cliromosome. 
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A "part'' of a nucleic acid molecule refers to a nucleic acid molecule that 
comprises a partial contiguous sequence of at least 10 bases of the reference nucleic acid 
molecule. Preferably, a part comprises at least 1 5 to 20 bases of a reference nucleic acid 
molecule. In theory, a nucleic acid sequence of 17 nucleotides is of sufficient length to 
5 occur at random less frequently than once in the three gigabase human genome, and thus 
provides a nucleic acid probe that can uniquely identify the reference sequence in a 
nucleic acid mixture of genomic complexit}^ A preferred part is one that comprises a 
nucleic acid sequence that can encode at least 6 contiguous amino acid sequences 
(fragments of at least 18 nucleotides) because they are useful in directing the expression or 

10 synthesis of peptides that are useful in mapping the epitopes of the pol>'peptide encoded 
by the reference nucleic acid. See, e.g., Geysen et al, Proc. Natl. Acad. Sci. USA 
81:3998-4002 (1984); and U.S. Patent Nos. 4,708,871 and 5,595,915, the disclosures of 
which are incorporated herein by reference in their entireties. A part may also comprise at 
least 25, 30, 35 or 40 nucleotides of a reference nucleic acid molecule, or at least 50, 60, 

15 70, 80, 90, 100, 150, 200, 250, 300, 350, 400 or 500 nucleotides of a reference nucleic 
acid molecule. A part of a nucleic acid molecule may comprise no other nucleic acid 
sequences. Alternatively, a part of a nucleic acid may comprise other nucleic acid 
sequences from other nucleic acid molecules. 

The term "oligonucleotide'' refers to a nucleic acid molecule generally comprising 

20 a length of 200 bases or fewer. The term often refers to single-stranded 

deoxyribonucleotides, but it can refer as well to single-or double-stranded ribonucleotides, 
RNA:DNA hybrids and double-stranded DNAs, among others. Preferably, 
oligonucleotides are 10 to 60 bases in length and most preferably 12, 13, 14, 15, 16, 17, 
18, 19 or 20 bases in length. Other preferred oligonucleotides are 25, 30, 35, 40, 45, 50, 

25 55 or 60 bases in length. Oligonucleotides may be single-stranded, e.g. for use as probes 
or primers, or may be double-stranded, e.g. for use in the construction of a mutant gene. 
Oligonucleotides of the invention can be either sense or antisense oligonucleotides. An 
oligonucleotide can be derivatized or modified as discussed above for nucleic acid 
molecules. 

30 Oligonucleotides, such as single-stranded DNA probe oligonucleotides, often are 

synthesized by chemical methods, such as those implemented on automated 
"oligonucleotide synthesizers. However, oligonucleotides can be made by a variety of 
other methods, including in vitro recombinant DNA-mediated techniques and by 
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expression of DNAs in cells and organisms. Initially, chemically synthesized DNAs 
typically are obtained without a 5' phosphate. The 5' ends of such oligonucleotides are 
not substrates for phosphodiester bond formation by ligation reactions that employ DNA 
ligases t>'pical]y used to form recombinant DNA molecules. \\Tiere ligation of such 
5 oligonucleotides is desired, a phosphate can be added by standard techniques, such as 
those that employ a kinase and ATP. The 3' end of a chemically synthesized 
oligonucleotide generally has a free hydroxy! group and, in the presence of a ligase, such 
as T4 DNA ligase, readily will form a phosphodiester bond with a 5' phosphate of another 
pol>Tiucleotide, such as another oligonucleotide. As is well known, this reaction can be 

10 prevented selectively, where desired, by removing the 5' phosphates of the other 
polynucleotide(s) prior to ligation. 

The term "naturally occurring nucleotide'' referred to herein includes naturally 
occurring deoxyribonucleotides and ribonucleotides. The term ''modified nucleotides" 
referred to herein includes nucleotides with modified or substituted sugar groups and the 

15 like. The term "nucleotide linkages" referred to herein includes nucleotide linkages such 
as phosphorothioate, phosphorodithioate, phosphoroselenoate, phosphorodiselenoate, 
phosphoroanilothioate, phoshoraniladate, phosphoroamidate, and the like. See e.g., 
LaPlanche et al Nucl Acids Res. 14:9081-9093 (1986); Stein ef al Nuci Acids Res. 
16:3209-3221 (1988); Zon et ai Anti-Cancer Drug Design 6:539-568 (1991); Zon et al, 

20 in Eckstein (ed.) Oligonucleotides and Analogues: A Practical Approach , pp. 87-108, 
Oxford University Press (1991); Uhlmann and Peyman Chemical Reviews 90:543 (1990), 
and U.S. Patent No. 5,1 51,510, the disclosure of which is hereby incorporated by 
reference in its entiretj'. 

Unless specified otherwise, the left hand end of a polynucleotide sequence in sense 

25 orientation is the 5' end and the right hand end of the sequence is the 3' end. In addition, 
the left hand direction of a polynucleotide sequence in sense orientation is referred to as 
the 5.' direction, while the right hand direction of the polynucleotide sequence is referred 
to as the 3' direction. Further, unless otherwise indicated, each nucleotide sequence is set 
forth herein as a sequence of deoxyribonucleotides. It is intended, however, that the given 

30 sequence be interpreted as would be appropriate to the polynucleotide composition: for 
example, if the isolated nucleic acid is composed of KNA, the given sequence intends 
ribonucleotides, with uridine substituted for thymidine. 
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The term "allelic variant" refers to one of wo or more alternative naturally 
occurring forms of a gene, wherein each gene possesses a unique nucleotide sequence. In 
a preferred embodiment, different alleles of a given gene have similar or identical 
biological properties. 

5 The term "percent sequence identity" in the context of nucleic acid sequences 

refers to the residues in nvo sequences which are the same when aligned for maximum 
correspondence. The length of sequence identity comparison may be over a stretch of at 
least about nine nucleotides, usually at least about 20 nucleotides, more usually at least 
about 24 nucleotides, t>T}ically at least about 28 nucleotides, more typically at least about 

10 32 nucleotides, and preferably at least about 36 or more nucleotides. There are a number 
of different algorithms known in the art which can be used to measure nucleotide sequence 
• identit^^ For instance, pohaiucleotide sequences can be compared using FASTA, Gap or 
Bestfit, which are programs in Wisconsin Package Version 10.0, Genetics Computer 
Group (GCG), iMadison, Wisconsin. FASTA, which includes, e.g., the programs FASTA2 

1 5 and FASTA3, provides alignments and percent sequence identity^ of the regions of the best 
overlap benveen the query and search sequences (Pearson, Methods Enzymol 183: 63-98 
(1990); Pearson, Methods Mol Biol 132: 185-219 (2000); Pearson, Methods Enzymol 
266: 227-258 (1996); Pearson, / Mol Biol 276: 71-84 (1998)). Unless othenvise 
specified, default parameters for a particular program or algorithm are used. For instance, 

20 percent sequence identit>' benveen nucleic acid sequences can be determined using 
FASTA with its default parameters (a word size of 6 and the NOP AM factor for the 
scoring matrix) or using Gap with its default parameters as provided in GCG Version 6.1. 

A reference to a nucleic acid sequence encompasses its complement unless 
otherwise specified. Thus, a reference to a nucleic acid molecule having a particular 

25 sequence should be understood to encompass its complementarj' strand, with its 

complementary sequence. The complementary strand is also useful, e.g., for antisense 
therapy, double-stranded RNA (dsRNA) inhibition (RNAi), combination of triplex and 
antisense, hybridization probes and PGR primers. 

In the molecular biology art, researchers use the terms "percent sequence identity", 

3.0 "percent sequence similarity" and "percent sequence homolog>'" interchangeably. In this 
application, these terms shall have the same meaning with respect to nucleic acid 
sequences only. 
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The term "substantial similarity" or "substantial sequence similarity," when 
referring to a nucleic acid or fragment thereof, indicates that, when optimally aligned with 
appropriate nucleotide insertions or deletions with another nucleic acid (or its 
complementar>' strand), there is nucleotide sequence identity in at least about 50%, more 
5 preferably 60% of the nucleotide bases, usually at least about 70%, more usually at least 
about 80%, preferably at least about 90%, and more preferably at least about 95-98% of 
the nucleotide bases, as measured by any well known algorithm of sequence identity, such 
as FASTA, BLAST or Gap, as discussed above. 

Alternatively, substantial similarit>' exists between a first and second nucleic acid 

10 sequence when the first nucleic acid sequence or fragment thereof hybridizes to an 
antisense strand of the second nucleic acid, under selective hybridization conditions. 
Typically, selective hybridization will occur bet\veen the first nucleic acid sequence and 
an antisense strand of the second nucleic acid sequence when there is at least about 55% 
sequence identity benveen the first and second nucleic acid sequences — preferably at least 

1 5 about 65%, more preferably at least about 75%, and most preferably at least about 90% — 
over a stretch of at least about 14 nucleotides, more preferably at least 17 nucleotides, 
even more preferably at least 20, 25, 30, 35, 40, 50, 60, 70, 80, 90 or 100 nucleotides. 

Nucleic acid hybridization will be affected by such conditions as salt 
concentration, temperature, solvents, the base composition of the hybridizing species, 

20 length of the complementary regions, and the number of nucleotide base mismatches 

between the hybridizing nucleic acids, as will be readily appreciated by those skilled in the 
art. "Stringent hybridization conditions" and "stringent wash conditions" in the context 
of nucleic acid hybridization experiments depend upon a number of different physical 
parameters. The most important parameters include temperature of hybridization, base 

25 composition of the nucleic acids, salt concentration and length of the nucleic acid. One 
having ordinary skill in the art knows how to vary these parameters to achieve a particular 
stringency of hybridization. In general, "stringent hybridization" is performed at about 
25^C below the thermal melting point (Tm) for the specific DNA hybrid under a particular 
set of conditions. "Stringent washing" is performed at temperatures about 5T lower than 

30 the Tm for the specific DNA hybrid under a particular set of conditions. The Tm is the 
temperature at which 50% of the target sequence hybridizes to a perfectly matched probe. 
See Sambrook (1989), supra, p. 9.51 . 

The Tm for a particular DNA-DNA hybrid can be estimated by the formula: 
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T„ = SLS'-C + 16.6 (log,o[Na*]) + 0.41 (fraction G + C) - 
0.63 (% formamide) - (600/1) where 1 is the length of the hybrid in base pairs. 
The Tm for a particular RNA-RNA hybrid can be estimated by the formula: 
T„ = 79.8»C + 18.5 (Iog,o[Na*]) + 0.5S (fraction G + C) + 
1 1 .8 (fraction G + C)' - 0.35 (% formamide) - (S20/1). 
The Tn, for a particular RNA-DNA hybrid can be estimated by the formula: 
T„ = 79.8°C + 18.5(logio[Na^]) + 0.58 (fraction G + C) + 
1 1.8 (fraction G + C)^ - 0.50 (% formamide) - (820/1). 

In general, the T„, decreases by 1-1.5°C for each 1% of mismatch between t^vo 
nucleic acid sequences. Thus, one having ordinary skill in the art can alter hybridization 
and/or washing conditions to obtain sequences that have higher or lower degrees of 
sequence identit>' to the target nucleic acid. For instance, to obtain hybridizing nucleic 
acids that contain up to 10% mismatch from the target nucleic acid sequence, 10-1 5°C 
would be subtracted from the calculated T„ of a perfectly matched hybrid, and then the 
hybridization and washing temperatures adjusted accordingly. Probe sequences may also 
hybridize specifically to duplex DNA under certain conditions to form triplex or other 
higher order DNA complexes. The preparation of such probes and suitable hybridization 
conditions are well known in the art. 

An example of stringent hybridization conditions for hybridization of 
20 complementary nucleic acid sequences having more than 1 00 complementaiy residues on 
a filter in a Southern or Northern blot or for screening a library is 50% fonnamide/6X SSC 
at 42*C for at least ten hours and preferably overnight (approximately 16 hours). Another 
example of stringent hybridization conditions is 6X SSC at 6S°C without formamide for at 
least ten hours and preferably overnight An example of moderate stringency 
25 hybridization conditions is 6X SSC at 55»C without formamide for at least ten hours and 
preferably overnight. An example of low stringency hybridization conditions for 
hybridization of complementary nucleic acid sequences having more than 100 
complementaiy residues on a filter in a Southern or northern blot or for screening a librae 
is 6X SSC at 42°C for at least ten hours. Hybridization conditions to identify nucleic acid 
30 sequences that are similar but not identical can be identified by experimentally changing 
the hybridization temperature from 68°C to 42^ while keeping the salt concentration 
constant (6X SSC), or keeping 'the hybridization temperature and salt concentration 
constant (e.g. Al'C and 6X SSC) and varying the formamide concentration from 50% to 
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0%. Hybridization buffers may also include blocking agents to lower background. These 
agents are well known in the art. See Sambrook et al (1989), supra, pages 8.46 and 9.46- 
9.58. See also Ausubel (1992), supra, Ausubel (1999), supra^ and Sambrook (2001), 
supra, 

5 Wash conditions also can be altered to change stringency conditions. An example 

of stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes {see Sambrook 
(1989), supra, for SSC buffer). Often the high stringency wash is preceded by a low 
stringency wash to remove excess probe. An exemplary medium stringency wash for 
duplex DNA of more than 100 base pairs is Ix SSC at 45T for 15 minutes. An 

10 exemplar}' low stringency wash for such a duplex is 4x SSC at 40°C for 15 minutes. In 
general, signal-to-noise ratio of 2x or higher than that obsen'ed for an uru^elated probe in 
the particular hybridization assay indicates detection of a specific hybridization. 

As defined herein, nucleic acids that do not hybridize to each other under stringent 
conditions are still substantially similar to one another if they encode polj'peptides that are 

1 5 substantially identical to each other. This occurs, for example, when a nucleic acid is 
created synthetically or recombinantly using a high codon degeneracy as permitted by the 
redundancy of the genetic code. 

Hybridization conditions for nucleic acid molecules that are shorter than 100 
nucleotides in length (e.g., for oligonucleotide probes) may be calculated by the formula: 

20 Tm = 81.5°C + 16.6(logio[Na*]) + 0.41(fraction G+C) -(600/N), wherein N is 

change length and the [Na"^] is 1 M or less. See Sambrook (1989), supra, p. 1 1.46. For 
hybridization of probes shorter than 100 nucleotides, hybridization is usually performed 
under stringent conditions (5-lOT below the Tm) using high concentrations (0.1-1.0 
pmol/ml) of probe. Id. at p. 11.45. Determination of hybridization using mismatched 

25 probes, pools of degenerate probes or "guessmers," as well as hybridization solutions and 
methods for empirically determining hybridization conditions are well known in the art. 
See, e,g., Ausubel (1999), supra; Sambrook (1989), supra, pp. 1 1.45-1 1.57. 

The term "digestion" or "digestion of DNA" refers to catalytic cleavage of the 
DNA with a restriction enzyme that acts only at certain sequences in the DNA. The 

30 various restriction enzymes referred to herein are commercially available and their 

reaction conditions, cofactors and other requirements for use are known and routine to the 
skilled artisan. For analytical purposes, typically, 1 pg of plasmid or DNA fragment is 
digested with about 2 units of enzyme in about 20 yl of reaction buffer. For the purpose of 
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isolating DNA fragments for plasmid construction, t>'pically 5 to 50 yg of DNA are 
digested with 20 to 250 units of enzyme in proportionately larger volumes. Appropriate 
buffers and substrate amounts for particular restriction enzymes are described in standard 
laboratory manuals, such as those referenced below, and are specified by commercial 
5 suppliers. Incubation times of about 1 hour at 37*C are ordinarily used, but conditions 
may varj' in accordance with standard procedures, the supplier's instructions and the 
particulars of the reaction. After digestion, reactions may be analyzed, and fragments may 
be purified by electrophoresis through an agarose or polyacr>1amide gel, using well 
known methods that are routine for those skilled in the art. 

10 The term "ligation" refers to the process of forming phosphodiester bonds between 

two or more polynucleotides, which most often are double-stranded DNAs. Techniques 
for ligation are well known to the art and protocols for ligation are described in standard 
laborator>^ manuals and references, such as, e.g., Sambrook (1989), supra. 

Genome-derived "single exon probes," are probes that comprise at least part of an 

15 exon ("reference exon") and can hybridize detectably under high stringency conditions to 
transcript-derived nucleic acids that include the reference exon but do not hybridize 
detectably under high sti'ingency conditions to nucleic acids that lack the reference exon. 
Single exon probes t>TDically further comprise, contiguous to a first end of the exon 
portion, a first intronic and/or intergenic sequence that is identically contiguous to the 

20 exon in the genome, and may contain a second intronic and/or intergenic sequence that is 
identically contiguous to the exon in the genome. The minimum length of genome- 
derived single exon probes is defined by the requirement that the exonic portion be of 
sufficient length to hybridize under high stringency conditions to transcript-derived 
nucleic acids, as discussed above. The maximum length of genome-derived single exon 

25 probes is defined by the requirement that the probes contain portions of no more than one 
exon. The single exon probes may contain priming sequences not found in contiguity with 
the rest of the probe sequence in the genome, which priming sequences are useful for PGR 
and other amplification-based technologies. In another aspect, the invention is directed to 
single exon probes based on the CSNAs disclosed herein. 

30 In one embodiment, the term "microarray" refers to a "nucleic acid microarray" 

having a substrate-bound plurality of nucleic acids, hybridization to each of the plurality 
^ of bound nucleic acids being separately detectable. The substrate can be solid or porous, 
planar or non-planar, unitary or distributed. Nucleic acid microarrays include all the 
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devices so called in Schena (ed.), DNA Microarravs: A Practical Approach fPractical 
Approach Series ^ Oxford University Press (1999); Nature Genet 21(l)(suppl.):l - 60 

(1999) ; Schena (ed.), Microarrav Biochip: Tools and Technoloev , Eaton Publishing 
Company/BioTechniques Books Division (2000). Additionally, these nucleic acid 

5 microanays include a substrate-bound pluralit>' of nucleic acids in which the pluralitj' of 
nucleic acids are disposed on a pluralit}' of beads, rather than on a unitar>' planar substrate, 
as is described, inter alia, in Brenner et al.Proc, Natl Acad. Set, USA 97(4):1665-1670 

(2000) . Examples of nucleic acid microarrays may be found in U.S. Patent Nos. 
6,391,623, 6,383,754, 6,383,749, 6,380,377, 6,379,897, 6,376,191, 6,372,431, 6,351,712 

10 6,344,316, 6,316,193, 6,312,906, 6,309,828, 6,309,824, 6,306,643, 6,300,063, 6,287,850, 
6,284,497, 6,284,465, 6,280,954, 6,262,216, 6,251,601, 6,245,518, 6,263,287, 6,251,601, 
6,238,866, 6,228,575, 6,214,587, 6,203,989, 6,171,797, 6,103,474, 6,083,726, 6,054,274, 
6,040,138, 6,083.726, 6,004,755, 6,001,309, 5,958,342, 5,952,180, 5,936,731, 5,843,655, 
5,814,454, 5,837,196, 5,436,327, 5,412,087, and 5,405,783, the disclosures of which are 

1 5 incorporated herein by reference in their entireties. 

In an alternative embodiment, a "microarray" may also refer to a "peptide 
microarrav" or "protein microarray" ha\'ing a substrate-bound collection or plurality of 
polypeptides, the binding to each of the plurality of bound polypeptides being separately 
detectable. Alternatively, the peptide microarray may have a plurality of binders, 

20 including but not limited to monoclonal antibodies, polyclonal antibodies, phage display 
binders, yeast 2 hybrid binders, and aptamers, which can specifically detect the binding of 
the polj'peptides of this invention. The array may be based on autoantibody detection to 
the poljipeptides of this invention, see Robinson et a/., Nature Medicine 8(3):295-301 
(2002). Examples of peptide arrays may be found in WO 02/31463, WO 02/25288, WO 

25 01/94946, WO 01/88162, WO 01/68671, WO 01/57259, WO 00/61806, WO 00/54046, 
WO 00/47774, WO 99/40434, WO 99/39210, and WO 97/42507 and U.S. Patent Nos. 
6,268,210, 5,766,960, and 5,143,854, the disclosures of which are incorporated herein by 
reference in their entireties. 

In addition, determination of the levels of the CSNA or CSP may be made in a 

30 multiplex manner using techniques described in WO 02/29109^ WO 02/24959, WO 
01/83502, WOOl/731 13, WO 01/59432, WO 01/57269, and WO 99/67641, the 
disclosures of which are incorporated herein by reference in their entireties. 
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The term "mutant", "mutated", or "mutation" when applied to nucleic acid 
sequences means that nucleotides in a nucleic acid sequence may be inserted, deleted or 
changed compared to a reference nucleic acid sequence. A single alteration may be made 
at a locus (a point mutation) or multiple nucleotides may be inserted, deleted or changed at 
5 a single locus. In addition, one or more alterations may be made at any number of loci 
within a nucleic acid sequence. In a preferred embodiment of the present invention, the 
nucleic acid sequence is the wild t>T>e nucleic acid sequence encoding a CSP or is a 
CSNA. The nucleic acid sequence may be mutated by any method known in the art 
including those mutagenesis techniques described infra. 
10 The term "error-prone PGR" refers to a process for performing PGR under 

conditions where the copying fidelit>' of the DNA polymerase is low, such that a high rate 
of point mutations is obtained along the entire length of the PGR product. See, e.g., Leung 
e! aL, Technique 1:11-15 (19S9) and Galdwell et al, PGR Methods Applic. 2: 28-33 
(1992). 

15 The term "oligonucleotide-directed mutagenesis" refers to a process which enables 

the generation of site-specific mutations in any cloned DNA segment of interest. See, e.g., 

Reidhaar-OIson et al, Science 241: 53-57 (1988). 

The term "assembly PGR" refers to a process which involves the assembly of a 

PGR product from a mixture of small DNA fragments. A large number of different PGR 
20 reactions occur in parallel in the same vial, with the products of one reaction priming the 

products of another reaction. 

The term "sexual PGR mutagenesis" or "DNA shuffling" refers to a method of 

en'or-prone PGR coupled with forced homologous recombination beUveen DNA 

molecules of different but highly related DNA sequence in vitro, caused by random 
25 fragmentation of the DNA molecule based on sequence similarity, followed by fixation of 

the crossover by primer extension in an error-prone PGR reaction. See, e.g., Stemmer, 

Proc. Natl Acad. Sci. U.S.A. 91: 10747-10751 (1994). DNA shuffling can be carried out 

benveen several related genes ("Family shuffling"). 

The term "/;; vivo mutagenesis" refers to a process of generating random mutations 
30 in any cloned DNA of interest which involves the propagation of the DNA in a strain of 

bacteria such as E. coli that cairies mutations in one or more of the DNA repair pathways. 

These "mutator" strains have a higher random mutation rate than that of a wild-type * 
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parent. Propagating the DNA in a mutator strain will eventually generate random 
mutations within the DNA. 

The term "cassette mutagenesis" refers to any process for replacing a small region 
of a double-stranded DNA molecule with a synthetic oligonucleotide "cassette" that 
5 differs from the native sequence. The oligonucleotide often contains completely and/or 
partially randomized native sequence. 

The term "recursive ensemble mutagenesis'- refers to an algorithm for protein 
engineering (protein mutagenesis) developed to produce diverse populations of 
phenotypically related mutants whose members differ in amino acid sequence. This 
10 method uses a feedback mechanism to control successive rounds of combinatorial cassette 
mutagenesis. See, e.g., Arkin e( al, Proc. Natl. Acad. Sci. U.S.A. 89: 781 1-7815 (1992). 

The term "exponential ensemble mutagenesis" refers to a process for generating 
combinatorial libraries with a high percentage of unique and functional mutants, wherein 
small groups of residues are randomized in parallel to identify, at each altered position, 
1 5 amino acids which lead to functional proteins. See, e.g. , Delegrave et al. , Biotechnology^ 
Research 11: 1548-1552 (1993); Arnold, Current Opinion in Biotechnology^ A: 450-455 
(1993). 

"Operatively linked" expression control sequences refers to a linkage in which the 
expression control sequence is either contiguous with the gene of interest to control the 

20 gene of interest, or acts in trans or at a distance to control the gene of interest. 

The term "expression control sequence" as used herein refers to polynucleotide 
sequences which are necessar>' to affect the expression of coding sequences to which they 
are operatively linked. Expression control sequences are sequences which control the 
transcription, post-transcriptional events and translation of nucleic acid sequences. 

25 Expression control sequences include appropriate transcription initiation, termination, 
promoter and enhancer sequences; efficient RNA processing signals such as splicing and 
polyadenylation signals; sequences that stabilize c>toplasmic mRNA; sequences that 
enhance translation efficiency {e.g.^ ribosome binding sites); sequences that enhance 
protein stability; and when desired, sequences that enhance protein secretion. The nature 

30 of such control sequences differs depending upon the host organism; in prokaryotes, such 
control sequences generally include promoter, ribosomal binding site, and transcription 
termination sequence. The term "control sequences" is intended to include, at a minimum, 
all components whose presence is essential for expression, and can also include additional 
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components whose presence is advantageous, for example, leader sequences and fiision 
partner sequences. 

The term "vector," as used herein, is intended to refer to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
5 vector is a "plasmid*', which refers to a circular double-stranded DNA loop into which 
additional DNA segments may be ligated. Other vectors include cosmids, bacterial 
artificial chromosomes (BAG) and yeast artificial chromosomes (YAC). Another type of 
vector is a viral vector, wherein additional DNA segments may be ligated into the viral 
genome. Viral vectors that infect bacterial cells are referred to as bacteriophages. Certain 

10 vectors are capable of autonomous replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of replication). Other vectors can be 
integrated into the genome of a host cell upon introduction into the host cell and thereby 
are replicated along with the host genome. Moreover, certain vectors are capable of 
directing the expression of genes to which they are operatively linked. Such vectors are 

15 referred to herein as "recombinant expression vectors'* (or simply, "expression vectors"). 
In general, expression vectors of utilit>' in recombinant DNA techniques are often in the 
form of plasmids. In the present specification, "plasmid" and "vector" may be used 
interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include other forms of expression vectors that serve equivalent 

20 functions. 

The term "recombinant host cell" (or simply "host cell"), as used herein, is 

intended to refer to a cell into which a recombinant expression vector has been introduced. 

It should be understood that such terms are intended to refer not only to the particular 

subject cell but also to the progeny of such a cell. Because certain modifications may 
25 occur in succeeding generations due to either mutation or environmental influences, such 

progeny may not, in fact, be identical to the parent cell, but are still included within the 

scope of the term "host cell" as used herein. 

As used herein, the phrase "open reading frame" and the equivalent acronym 

"ORF* refers to that portion of a transcript-derived nucleic acid that can be translated in its 
30 entirety into a sequence of contiguous amino acids. As so defined, an ORF has length, 

measured in nucleotides, exactly divisible by 3. As so defined, an ORF need not encode 

the entirety of a natural protein. 
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As used herein, the phrase "ORF-encoded peptide" refers to the predicted or actual 
translation of an ORF. 

As used herein, the phrase "degenerate variant" of a reference nucleic acid 
sequence is meant to be inclusive of all nucleic acid sequences that can be directly 
5 translated, using the standard genetic code, to provide an amino acid sequence identical to 
that translated from the reference nucleic acid sequence. 

The term "polypeptide" encompasses both naturally occurring and non-naturally 
occurring proteins and polypeptides, as well as polypeptide fragments and poI>'peptide 
mutants, derivatives and analogs thereof A polypeptide may be monomeric or polymeric. 

10 Further, a polypeptide may comprise a number of different modules within a single 

polj^peptide each of which has one or more distinct activities. A preferred polypeptide in 
accordance with the invention comprises a CSP encoded by a nucleic acid molecule of the 
instant invention, or a fragment, mutant, analog or derivative thereof. 

The term "isolated protein" or "isolated polypeptide" is a protein or polypeptide 

1 5 that by virtue of its origin or source of derivation (1 ) is not associated with naturally 
associated components that accompany it in its native state, (2) is free of other proteins 
from the same species (3) is expressed by a cell from a different species, or (4) does not 
occur in nature. Thus, a pol>'peplide that is chemically synthesized or synthesized in a 
cellular system different from the cell from which it naturally originates will be "isolated" 

20 from its naturally associated components. A polypeptide or protein may also be rendered 
substantially free of naturally associated components by isolation, using protein 
purification techniques well known in the art. 

A protein or polypeptide is "substantially pure," "substantially homogeneous" or 
"substantially purified" when at least about 60% to 75% of a sample exhibits a single 

25 species of polypeptide. The pol^TDeptide or protein may be monomeric or multimeric. A 
substantially pure polypeptide or protein will typically comprise about 50%, 60%, 70%, 
80?^ or 90% WAV of a protein sample, more usually about 95%, and preferably will be 
over 99% pure. Protein purity or homogeneitj' may be detennined by a number of means 
well known in the art, such as polyaciylamide gel electrophoresis of a protein sample, 

30 followed by visualizing a single polypeptide band upon staining the gel with a stain well 
known in the art. For certain purposes, higher resolution may be provided by using HPLC 
or other means well known in the art for purification. 
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The term "fragment" when used herein with respect to polypeptides of the present 
invention refers to a polypeptide that has an amino-terminal and/or carboxy-terminal 
deletion compared to a full-length CSP. In a preferred embodiment, the fragment is a 
contiguous sequence in which the amino acid sequence of the fragment is identical to the 
5 corresponding positions in the naturally occurring polypeptide. Fragments tj^pically are at 
least 5, 6, 7, 8, 9 or 10 amino acids long, preferably at least 12, 14, 16 or 18 amino acids 
long, more preferably at least 20 amino acids long, more preferably at least 25, 30, 35, 40 
or 45, amino acids, even more preferably at least 50 or 60 amino acids long, and even 
more preferably at least 70 amino acids long. 

10 A "derivative" when used herein with respect to polypeptides of the present 

invention refers to a polypeptide which is substantially similar in primary structural 
sequence to a CSP but which includes, e.g., in vivo or in vitro chemical and biochemical 
modifications that are not found in the CSP. Such modifications include, for example, 
acetylation, acylation, ADP-ribosylation, amidation, covalent attachment of flavin, 

15 covalent attachment of a heme moiet>', covalent attachment of a nucleotide or nucleotide 
derivative, covalent attachment of a lipid or lipid derivative, covalent attachment of 
phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, demethylation, 
formation of covalent cross-links, formation of cystine, formation of p>Toglutamate, 
formylation, gamma-carboxylation, glycosylation, GPI anchor formation, hydroxylation, 

20 iodination, methylation, myristoylation, oxidation, proteohlic processing, 

phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 
Other modifications include, €,g,, labeling with radionuclides, and various enzymatic 
modifications, as will be readily appreciated by those skilled in the art. A variety of 

25 methods for labeling polypeptides and of substituents or labels useful for such purposes 
are well known in the art, and include radioactive isotopes such as '"^I, ^'P, ^^S, '''C and 
^H, ligands which bind to labeled antiligands (e.g., antibodies), fluorophores, 
chemiluminescent agents, enz>Tnes, and antiligands which can sen'e as specific binding 
pair members for a labeled ligand. The choice of label depends on the sensitivity required, 

30 ease of conjugation with the primer, stability requirements, and available instrumentation. 
Methods for labeling poljT^eptides are well known in the art. See Ausubel (1992), supra; 
Ausuh&l (\999), supra. 
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The term "fusion protein" refers to polypeptides of the present invention coupled 
to a heterologous amino acid sequence. Fusion proteins are useful because they can be 
constructed to contain ^vo or more desired functional elements from nvo or more different 
proteins. A fusion protein comprises at least 10 contiguous amino acids from a 
5 polj^peptide of interest, more preferably at least 20 or 30 amino acids, even more 

preferably at least 40, 50 or 60 amino acids, yet more preferably at least 75, 100 or 125 
amino acids. Fusion proteins can be produced recombinantly by constructing a nucleic 
acid sequence that encodes the polypeptide or a fragment thereof in frame with a nucleic 
acid sequence encoding a different protein or peptide and then expressing the fusion 

10 protein. Alternatively, a fusion protein can be produced chemically by crosslinking the 
pol>peptide or a fragment thereof to another protein. 

The term "analog" refers to both pohpeptide analogs and non-peptide analogs. 
The term "polypeptide analog'' as used herein refers to a polypeptide that is comprised of a 
segment of at least 25 amino acids that has substantial identity to a portion of an amino 

15 acid sequence but which contains non-natural amino acids or non-natural inter-residue 
bonds. In a preferred embodiment, the analog has the same or similar biological activity 
as the native polypeptide. Tjpically, polypeptide analogs comprise a conservative amino 
acid substitution (or insertion or deletion) with respect to the naturally occurring sequence. 
Analogs t>'pically are at least 20 amino acids long, preferably at least 50 amino acids long 

20 or longer, and can often be as long as a full-length naturally occurring polypeptide. 

The term "non-peptide analog" refers to a compound with properties that are 
analogous to those of a reference polypeptide. A non-peptide compound may also be 
termed a "peptide mimetic" or a "peptidomimetic." Such compounds are often developed 
with the aid of computerized molecular modeling. Peptide mimetics that are structurally 

25 similar to useful peptides may be used to produce an equivalent effect. Generally, 

peptidomimetics are structurally similar to a paradigm polypeptide (/.^., a polj^eptide that 
has a desired biochemical property or pharmacological activity), but have one or more 
peptide linkages optionally replaced by a linkage selected from the group consisting of: 
-CH2NH", --CH2S", -CH2-CH2-, "CH=CH"(cis and trans). "COCH2-, 

30 "CH(0H)CH2-, and -CH2SO", by methods well known in the art. Systematic 

substitution of one or more amino acids of a consensus sequence with a D-amino acid of 
the same type (e.g., D-lysine in place of L-lysine) may also be used to generate more 
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stable peptides. In addition, constrained peptides comprising a consensus sequence or a 
substantially identical consensus sequence variation may be generated by methods known 
in the art (Rizo et al, Awl Rev, Biochem, 61:387-418 (1992)). For example, one may add 
internal cysteine residues capable of forming intramolecular disulfide bridges which 
5 cyclize the peptide. 

The term "mutant" or *'mutein" when referring to a polypeptide of the present 
invention relates to an amino acid sequence containing substioitions, insertions or 
deletions of one or more amino acids compared to the amino acid sequence of a CSP. A 
mutein may have one or more amino acid point substitutions, in which a single amino acid 

10 at a position has been changed to another amino acid, one or more insertions and/or 
deletions, in which one or more amino acids are inserted or deleted, respectively, in the 
sequence of the naturally occurring protein, and/or truncations of the amino acid sequence 
at either or both the amino or carboxy termini. Further, a mutein may have the same or 
different biological activit}^ as the naturally occurring protein. For instance, a mutein may 

1 5 have an increased or decreased biological activity. A mutein has at least 50% sequence 
similarit>' to the wild type protein, preferred is 60% sequence similarity, more preferred is 
70% sequence similarity. Even more preferred are muteins having S0%, 85% or 90% 
sequence similarit>' to a CSP. In an even more preferred embodiment, a mutein exhibits 
95% sequence identity, even more preferably 97%. even more preferably 98% and even 

20 more preferably 99%. Sequence similarity may be measured by any common sequence 
analysis algorithm, such as GAP or BESTFIT or other variation Smith-Wateman 
alignment See, T. F. Smith and M. S. Waterman, J. Mol. Biol. 147:195-197 (1981) and 
W.R. Pearson, Genomics 11:635-650 (1991). 

Preferred amino acid substitutions are those which: (1) reduce susceptibility to 

25 proteolysis, (2) reduce susceptibility to oxidation, (3) alter binding affinity for forming 
protein complexes, (4) alter binding affinity or enzymatic activity, and (5) confer or 
modify other physicochemical or functional properties of such analogs. For example, 
single or multiple amino acid substitutions (preferably conservative amino acid 
substitutions) may be made in the naturally occurring sequence (preferably in the portion 

30 of the polypeptide outside the domain(s) forming intermolecular contacts. In a preferred 
embodiment, the amino acid substitutions are moderately conservative substitutions or 
conser\'ative substitutions. In a more prefened embodiment, the amino acid substitutions 
are conser\'ative substitutions. A conservative amino acid substitution should not 
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substantially change the structural characteristics of the parent sequence (e.g., a 
replacement amino acid should not tend to disrupt a helix that occurs in the parent 
sequence, or disrupt other types of secondarj' structure that characterize the parent 
sequence). Examples of art-recognized polypeptide secondary and tertiar>' structures are 
5 described in Creighton (ed.), Proteins. Structures and Molecular Principles . \V. H. 
Freeman and Company (1984); Branden et al (ed.), Introduction to Protein Structure , 
Garland Publishing (1991); Thornton et al, Nature 354:105-106 (1991). 

As used herein, the nvent>' conventional amino acids and their abbreviations follow 
conventional usage. See Golub et al (eds.), Immunology - A Synthesis 2""^ Ed., Sinauer 
10 Associates (1991). Stereoisomers {e.g., D-amino acids) of the tvr^ent>' conventional amino 
acids, unnatural amino acids such as a-, a-di substituted amino acids. N-alkyl amino acids, 
and other unconventional amino acids may also be suitable components for polypeptides 
of the present invention. Examples of unconventional amino acids include: 

4- hydroxyproline, ycarboxyglutamate, £:-N,N,N-trimethyllysine, e-N-acet>'llysine, 
1 5 0-phosphoserine, N-acet>'lserine, N-formylmethionine, 3-methylhistidine, 

5- hydroxylysine, s-N-methylarginine, and other similar amino acids and imino acids (eg., 
4-hydroxyproline). In the polypeptide notation used herein, the lefthand direction is the 
amino terminal direction and the right hand direction is the carboxy-terminal direction, in 
accordance vi^ith standard usage and convention. 

20 By "homology" or "homologous" when referring to a polypeptide of the present 

invention it is meant polypeptides from different organisms with a similar sequence to 
the encoded amino acid sequence of a CSP and a similar biological activity or function. 
Although two polypeptides are said to be "homologous," this does not imply that there is 
necessarily an evolutionary relationship between the polypeptides. Instead, the term 

25 "homologous" is defined to mean that the ^vo polypeptides have similar amino acid 
sequences and similar biological activities or functions. In a preferred embodiment, a 
homologous polypeptide is one that exhibits 50% sequence similarity to CSP, preferred is 
60% sequence similarity, more preferred is 70% sequence similarity. Even more preferred 
are homologous polypeptides that exhibit 80%, 85% or 90% sequence similarity to a CSP. 

30 In yet a more preferred embodiment, a homologous polypeptide exhibits 95%, 97%, 98% 
or 99% sequence similarity. 

When "sequence similarity" is used in reference to polypeptides, it is recognized 
that residue positions that are not identical often differ by conservative amino acid 
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substitutions. In a preferred embodiment, a polypeptide that has "sequence similarity" 
comprises conservative or moderately conservative amino acid substitutions. A 
"conservative amino acid substitution" is one in which an amino acid residue is substituted 
by another amino acid residue having a side chain (R group) with similar chemical 
5 properties (e.g., charge or hydrophobicity). In general, a conservative amino acid 

substitution will not substantially change the functional properties of a protein. In cases 
where nvo or more amino acid sequences differ from each other by conservative 
substitutions, the percent sequence identit>' or degree of similarit>' may be adjusted 
upwards to correct for the conser\'ative nature of the substitution. Means for making this 
10 adjustment are well known to those of skill in the art. See^ e.g., Pearson, Methods Mol, 
Biol 24:307-31 (1994). 

For instance, the following six groups each contain amino acids that are 
conser\'ative substitutions for one another: 

1 ) Serine (S), Threonine (T); 
1 5 2) Aspartic Acid (D), Glutamic Acid (E); 

3) Asparagine (N), Glutamine (Q); 

4) Arginine (R), Lysine (K); 

5) Isoleucine (I), Leucine (L), Methionine (M), Alanine (A), Valine (V), and 

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 

20 Alternatively, a conservative replacement is any change having a positive value in 

the PAM250 log-likelihood matrix disclosed in Gonnet et ai. Science 256: 1443-45 
(1 992). A ''moderately conservative'' replacement is any change having a nonnegative 
value in the PAM250 log-likelihood matrix. 

Sequence similarity for polypeptides, which is also referred to as sequence 

25 identity, is typically measured using sequence analysis software. Protein analysis sofnvare 
matches similar sequences using measures of similarity assigned to various substitutions, 
deletions and other modifications, including conservative amino acid substitutions. For 
instance, GCG contains programs such as "Gap" and "Bestfit" which can be used with 
default parameters to determine sequence homology or sequence identit)' benveen closely 

30 related polypeptides, such as homologous pol>'peptides from different species of 

organisms or benveen a wild type protein and a mutein thereof. See, e,g., GCG Version 
6. 1 . Other programs include FASTA, discussed supra. 
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A preferred algorithm when comparing a sequence of the invention to a database 
containing a large number of sequences from different organisms is the computer program 
BLAST, especially blastp or tblastn. See, e.g., Altschul et al, J. Mol Biol 215: 403-410 
(1990); Altschul et al, Nucleic Acids Res, 25:3389-402 (1997). Preferred parameters for 
5 blastp are: 

Expectation value: 1 0 (default) 

Filter: seg (default) 

Cost to open a gap: 11 (default) 

Cost to extend a gap: 1 (default 
10 Max. alignments: 100 (default) 

Word size: 1 1 (default) 

No. of descriptions: 100 (default) 

Penalt>' Matrix: BLOSUM62 

The length of polypeptide sequences compared for homology will generally be at 

15 least about 16 amino acid residues, usually at least about 20 residues, more usually at least 
about 24 residues, typically at least about 28 residues, and preferably more than about 35 
residues. WTien searching a database containing sequences from a large number of 
different organisms, it is preferable to compare amino acid sequences. 

Algorithms other than blastp for database searching using amino acid sequences 

20 are known in the art. For instance, pol>T)eptide sequences can be compared using FASTA, 
a program in GCG Version 6.1 . FASTA {e.g., FASTA2 and FASTA3) provides 
alignments and percent sequence identity of the regions of the best overlap between the 
query and search sequences (Pearson (1 990), supra\ Pearson (2000), supra. For example, 
percent sequence identity beUveen amino acid sequences can be determined using FASTA 

25 with its default or recommended parameters (a word size of 2 and the PAM250 scoring 
matrix), as provided in GCG Version 6.1. 

An "antibody" refers to an intact immunoglobulin, or to an antigen-binding portion 
thereof that competes with the intact antibody for specific binding to a molecular species, 
e.^., a polypeptide of the instant invention. Antigen-binding portions may be produced by 

30 recombinant DNA techniques or by enzymatic or chemical cleavage of intact antibodies. 
Antigen-binding portions include, inter alia, Fab, Fab', F(ab')2, Fv, dAb, and 
complementarity determining region (CDR) fragments, single-chain antibodies (scFv), 
chimeric antibodies, diabodies and pol>'peptides that contain at least a portion of an 
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immunoglobulin that is sufficient to confer specific antigen binding to the polypeptide. A 
Fab fragment is a monovalent fragment consisting of the VL, VH, CL and CHI domains; 
a F(ab')2 fragment is a bivalent fragment comprising two Fab fragments linked by a 
disulfide bridge at the hinge region; a Fd fragment consists of the VH and CHI domains; a 
5 Fv fragment consists of the VL and \Ti domains of a single arm of an antibody; and a dAb 
fragment consists of a VH domain. See, e.g., Ward et al. Nature 341: 544-546 (19S9). 

By "bind specifically" and "specific binding" as used herein it is meant the ability 
of the antibody to bind to a first molecular species in preference to binding to other 
molecular species with which the antibody and first molecular species are admixed. An 

10 antibody is said to "recognize" a first molecular species when it can bind specifically to 
that first molecular species, 

A single-chain antibody (scFv) is an antibody in which VL and VH regions are 
paired to form a monovalent molecule via a synthetic linker that enables them to be made 
as a single protein chain. See, e.g.. Bird et al, Science 242: 423-426 (1988); Huston et al, 

15 Proc, Natl Acad. ScL USA 85: 5879-5883 (1988). Diabodies are bivalent, bispecific 

antibodies in which VH and VL domains are expressed on a single polypeptide chain, but 
using a linker that is too short to allow for pairing benveen the two domains on the same 
chain, thereby forcing the domains to pair with complementary domains of another chain 
and creating two antigen binding sites. See e.g., Holliger et ai.Proc. Natl Acad, Sci. USA 

20 90: 6444-6448 (1993); Poljak et al, Sti-ucture 2: 1 121-1 123 (1994). One or more CDRs 
may be incorporated into a molecule either covalently or noncovalently to make it an 
immunoadhesin. An immunoadhesin may incorporate the CDR(s) as part of a larger 
polypeptide chain, may covalently link the CDR(s) to another polypeptide chain, or may 
incorporate the CDR(s) noncovalently. The CDRs permit the immunoadhesin to 

25 specifically bind to a particular antigen of interest. A chimeric antibody is an antibody 
that contains one or more regions from one antibody and one or more regions from one or 
more other antibodies. 

An antibody may have one or more binding sites. If there is more than one binding 
site, the binding sites may be identical to one another or may be different. For instance, a 

30 naturally occurring immunoglobulin has two identical binding sites, a single-chain 
antibody or Fab fragment has one binding site, while a "bispecific" or "bifunctional" 
' antibody has nvo different binding sites. 
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An 'Isolated antibody" is an antibody that (1) is not associated with naturally- 
associated components, including other naturally-associated antibodies, that accompany it 
in its native state, (2) is free of other proteins from the same species, (3) is expressed by a 
cell from a different species, or (4) does not occur in nature. It is known that purified 
5 proteins, including purified antibodies, may be stabilized with non-naturally-associated 
components. The non-naturally-associated component may be a protein, such as albumin 
(e.g., BSA) or a chemical such as polyethylene glycol (PEG). 

A '^neutralizing antibody" or "an inhibitory antibody" is an antibody that inhibits 
the activity of a pol^^^eptide or blocks the binding of a polypeptide to a ligand that 
10 normally binds to it. An "activating antibody" is an antibody that increases the activit>' of 
a polypeptide. 

The term "epitope" includes any protein determinant capable of specific binding to 
an immunoglobulin or T-cell receptor. Epitopic determinants usually consist of 
chemically active surface groupings of molecules such as amino acids or sugar side chains 

1 5 and usually have specific three-dimensional structural characteristics, as well as specific 
charge characteristics. An antibody is said to specifically bind an antigen when the 
dissociation constant is less than! pM, preferably less than 100 nNl and most preferably 
less than 10 nM. 

The term "patient" includes human and veterinary subjects. 

20 Throughout this specification and claims, the word "comprise," or variations such 

as "comprises" or "comprising," will be understood to imply the inclusion of a stated 
integer or group of integers but not the exclusion of any other integer or group of integers. 

The term "colon specific" refers to a nucleic acid molecule or polypeptide that is 
expressed predominantly in the colon as compared to other tissues in the body. In a 

25 preferred embodiment, a "colon specific" nucleic acid molecule or polypeptide is detected 
at a level that is 1 .5-fold higher than any other tissue in the body. In a more preferred 
embodiment, the "colon specific" nucleic acid molecule or polypeptide is detected at a 
level that is 2-fold higher than any other tissue in the body, more preferably 5-fold higher, 
still more preferably at least 10-fold, 15-fold, 20-fold, 25-fold, 50-fold or 100-fold higher 

30 than any other tissue in the body. Nucleic acid molecule levels may be measured by 
nucleic acid hybridization, such as Northern blot hybridization, or quantitative PCR. 
* Polypeptide levels may be measured by any method known to accurately quantitate 
protein levels, such as Western blot analysis. 
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Nucleic Acid Molecules. Regulaton^ Sequences> Vectors. Host Cells and Recombinant 
Methods of Making Poh^eptides 
Nucleic Acid Molecules 

One aspect of the invention provides isolated nucleic acid molecules that are 
5 specific to the colon or to colon cells or tissue or that are derived from such nucleic acid 
molecules. These isolated colon specific nucleic acids (CSNAs) may comprise cDNA 
genomic DNA, RNA, or a combination thereof, a fragment of one of these nucleic acids, 
or may be a non-naturally occuiring nucleic acid molecule. A CSNA may be derived fi:om 
an animal. In a preferred embodiment, the CSNA is derived from a human or other 

10 mammal. In a more preferred embodiment, the CSNA is derived from a human or other 
primate. In an even more preferred embodiment, the CSNA is derived from a human. 

In a preferred embodiment, the nucleic acid molecule encodes a polypeptide that 
is specific to colon, a colon-specific polypeptide (CSP). In a more preferred embodiment, 
the nucleic acid molecule encodes a polypeptide that comprises an amino acid sequence of 

15 SEQ ID NO: 96-237. In another highly preferred embodiment, the nucleic acid molecule 
comprises a nucleic acid sequence of SEQ ID NO: 1-95. Nucleotide sequences of the 
instantly-described nucleic acid molecules were determined by assembling several DNA 
molecules from either public or proprietary databases. Some of the underlying DNA 
sequences are the result, directly or indirectly, of at least one enzymatic polymerization 

20 reaction (e.g., reverse transcription and/or polymerase chain reaction) using an automated 
sequencer (such as the MegaBACE"^^* 1000, Amersham Biosciences, Sunnj'vale, CA, 
USA). 

Nucleic acid molecules of the present invention may also comprise sequences that 
selectively hybridize to a nucleic acid molecule encoding a CSNA or a complement or 

25 antisense thereof. The hybridizing nucleic acid molecule may or may not encode a 

polypeptide or may or may not encode a CSP. However, in a preferred embodiment, the 
hybridizing nucleic acid molecule encodes a CSP. In a more preferred embodiment, the 
invention provides a nucleic acid molecule that selectively hybridizes to a nucleic acid 
molecule or the antisense sequence of a nucleic acid molecule that encodes a polypeptide 

30 comprising an amino acid sequence of SEQ ID NO: 96-237. In an even more preferred 
embodiment, the invention provides a nucleic acid molecule that selectively hybridizes to 
a nucleic acid molecule comprising the nucleic acid sequence of SEQ ID NO: 1-95 or the 
antisense sequence thereof. Preferably, the nucleic acid molecule selectively hybridizes to 
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a nucleic acid molecule or the antisense sequence of a nucleic acid molecule encoding a 
CSP under low stringency conditions. More preferably, the nucleic acid molecule 
selectively hybridizes to a nucleic acid molecule or the antisense sequence of a nucleic 
acid molecule encoding a CSP under moderate stringency conditions. Most preferably, 
5 the nucleic acid molecule selectively hybridizes to a nucleic acid molecule or the antisense 
sequence of a nucleic acid molecule encoding a CSP under high stringency conditions. In 
a preferred embodiment, the nucleic acid molecule hybridizes under low, moderate or high 
stringency conditions to a nucleic acid molecule or the antisense sequence of a nucleic 
acid molecule encoding a polypeptide comprising an amino acid sequence of SEQ ID NO: 
10 96-237. In a more preferred embodiment, the nucleic acid molecule hybridizes under low, 
moderate or high stringency conditions to a nucleic acid molecule or the antisense 
sequence of a nucleic acid molecule comprising a nucleic acid sequence selected from 
SEQ ID NO: 1-95. 

Nucleic acid molecules of the present invention may also comprise nucleic acid 

15 sequences that exhibit substantial sequence similarity to a nucleic acid encoding a CSP or 
a complement of the encoding nucleic acid molecule. In this embodiment, it is preferred 
that the nucleic acid molecule exhibit substantial sequence similarity to a nucleic acid 
molecule encoding human CSP. More preferred is a nucleic acid molecule exhibiting 
substantial sequence similarit>' to a nucleic acid molecule encoding a polypeptide having 

20 an amino acid sequence of SEQ ID NO: 96-237. By substantial sequence similarity it is 
meant a nucleic acid molecule having at least 60%, more preferably at least 70%, even 
more preferably at least 80% and even more preferably at least 85% sequence identitj' 
with a nucleic acid molecule encoding a CSP, such as a polypeptide having an amino acid 
sequence of SEQ ID NO: 96-237. In a more preferred embodiment, the similar nucleic 

25 acid molecule is one that has at least 90%, more preferably at least 95%, more preferably 
at least 97%, even more preferably at least 98%, and still more preferably at least 99% 
sequence identity with a nucleic acid molecule encoding a CSP. Most preferred in this 
embodiment is a nucleic acid molecule that has at least 99.5%, 99.6%, 99.7%, 99.8% or 
99.9% sequence identity with a nucleic acid molecule encoding a CSP. 

30 The nucleic acid molecules of the present invention are also inclusive of those 

exhibiting substantial sequence similarity to a CSNA or its complement. In this 
embodiment, if is preferred that the nucleic acid'molec'ule exhibit substantial sequence 
similarity to a nucleic acid molecule having a nucleic acid sequence of SEQ ID NO: 1-95. 
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By substantial sequence similarity it is meant a nucleic acid molecule that has at least 
60%, more preferably at least 70%, even more preferably at least 80% and even more 
preferably at least 85% sequence identity with a CSNA, such as one having a nucleic acid 
sequence of SEQ ID NO: 1-95. More preferred is a nucleic acid molecule that has at least 

5 90%, more preferably at least 95%, more preferably at least 97%, even more preferably at 
least 98%, and still more preferably at least 99% sequence identity with a CSNA. Most 
preferred is a nucleic acid molecule that has at least 99.5%, 99.6%, 99.7%, 99.8% or 
99.9% sequence identity with a CSNA. 

Nucleic acid molecules that exhibit substantial sequence similarity are inclusive of 

10 sequences that exhibit sequence identity over their entire length to a CSNA or to a nucleic 
acid molecule encoding a CSP, as well as sequences that are similar over only a part of its 
length. In this case, the part is at least 50 nucleotides of the CSNA or the nucleic acid 
molecule encoding a CSP, preferably at least 100 nucleotides, more preferably at least 150 
or 200 nucleotides, even more preferably at least 250 or 300 nucleotides, still more 

15 preferably at least 400 or 500 nucleotides. 

The substantially similar nucleic acid molecule may be a naturally occurring one 
that is derived from another species, especially one derived from another primate, wherein 
the similar nucleic acid molecule encodes an amino acid sequence that exhibits significant 
sequence identit>' to that of SEQ ID NO: 96-237 or demonstrates significant sequence 

20 identit)' to the nucleotide sequence of SEQ ID NO: 1-95. The similar nucleic acid 

molecule may also be a naturally occurring nucleic acid molecule from a human, when the 
CSNA is a member of a gene family. The similar nucleic acid molecule may also be a 
naturally occurring nucleic acid molecule derived from a non-primate, mammalian 
species, including without limitation, domesticated species, e.g., dog, cat, mouse, rat, 

25 rabbit, hamster, cow, horse and pig; and wild animals, e.g., monkey, fox, lions, tigers, 
bears, giraffes, zebras, etc. The substantially similar nucleic acid molecule may also be a 
naturally occurring nucleic acid molecule derived from a non-mammalian species, such as 
birds or reptiles. The naturally occurring substantially similar nucleic acid molecule may 
be isolated directly from humans or other species. In another embodiment, the 

30 substantially similar nucleic acid molecule may be one that is experimentally produced by 
random mutation of a nucleic acid molecule. In another embodiment, the substantially 
similar nucleic acid molecule may be one that is experimentally produced by directed 
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mutation of a CSNA. In a preferred embodiment, the substantially similar nucleic acid 
molecule is a CSNA. 

The nucleic acid molecules of the present invention are also inclusive of allelic 
variants of a CSNA or a nucleic acid encoding a CSP, For example, single nucleotide 
5 polymorphisms (SNPs) occur frequently in eukaryotic genomes and the sequence 

determined from one individual of a species may differ from other allelic forms present 
within the population. More than 1.4 million SNPs have already been identified in the 
human genome, International Human Genome Sequencing Consortium, Nature 409: 860- 
921 (2001) - Variants with small deletions and insertions of more than a single nucleotide 

10 are also found in the general population, and often do not alter the function of the protein. 
In addition, amino acid substitutions occur frequently among natural allelic variants, and 
often do not substantially change protein function. 

In a preferred embodiment, the allelic variant is a variant of a gene, wherein the 
gene is transcribed into a niRNA that encodes a CSP. In a more preferred embodiment, 

15 the gene is transcribed into a mRNA that encodes a CSP comprising an amino acid 

sequence of SEQ ID NO: 96-237. In another preferred embodiment, the allelic variant is a 
variant of a gene, wherein the gene is transcribed into a mRNA that is a CSNA. In a more 
preferred embodiment, the gene is transcribed into an mRNA that comprises the nucleic 
acid sequence of SEQ ID NO: 1 -95. Also preferred is that the allelic variant be a 

20 naturally occurring allelic variant in the species of interest, particularly human. 

Nucleic acid molecules of the present invention are also inclusive of nucleic acid 
sequences comprising a part of a nucleic acid sequence of the instant invention. The part 
may or may not encode a polypeptide, and may or may not encode a pol3'peptide that is a 
CSP. In a preferred embodiment, the part encodes a CSP. In one embodiment, the 

25 nucleic acid molecule comprises a part of a CSNA. In another embodiment, the nucleic 
acid molecule comprises a part of a nucleic acid molecule that hybridizes or exhibits 
substantial sequence similarity to a CSNA. In another embodiment, the nucleic acid 
molecule comprises a part of a nucleic acid molecule that is an allelic variant of a CSNA. 
In yet another embodiment, the nucleic acid molecule comprises a part of a nucleic acid 

30 molecule that encodes a CSP. A part comprises at least 10 nucleotides, more preferably at 
least 15, 17, 18, 20, 25, 30, 35, 40, 50, 60. 70. 80, 90, 100, 150, 200, 250. 300, 350, 400 or 
500 nucleotides. The maximum size of a nucleic acid part is one nucleotide shorter than 
the sequence of the nucleic acid molecule encoding the full-length protein. 
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Nucleic acid molecules of the present invention are also inclusive of nucleic acid 
sequences that encode fusion proteins, homologous proteins, polypeptide fragments, 
muteins and poljTDeptide analogs, as described infra. 

Nucleic acid molecules of the present invention are also inclusive of nucleic acid 
5 sequences containing modifications of the native nucleic acid molecule. Examples of such 
modifications include, but are not limited to, nonnative intemucleoside bonds, post- 
synthetic modifications or altered nucleotide analogues. One having ordinary skill in the 
art would recognize that the type of modification that may be made will depend upon the 
intended use of the nucleic acid molecule. For instance, when the nucleic acid molecule is 

10 used as a hybridization probe, the range of such modifications will be limited to those that 
pennit sequence-discriminating base pairing of the resulting nucleic acid. When used to 
direct expression of RNA or protein in vitro or in vivo, the range of such modifications 
will be limited to those that permit the nucleic acid to fimction properly as a 
polymerization substrate. When the isolated nucleic acid is used as a therapeutic agent, 

15 the modifications will be limited to those that do not confer toxicity upon the isolated 
nucleic acid. 

Accordingly, in one embodiment, a nucleic acid molecule may include nucleotide 
analogues that incorporate labels that are directly detectable, such as radiolabels or 
fluorophores, or nucleotide analogues that incorporate labels that can be visualized in a 

20 subsequent reaction, such as biotin or various haptens. The labeled nucleic acid molecules 
are particularly useful as hybridization probes. 

Common radiolabeled analogues include those labeled with ^^P, ^"P, and ^^S, such 
as a-^^P-dATP, a--'^P-dCTP, a-^^P-dGTP, a-^'P-dTTP, a-^^P-3'dATP, a-^-P-ATP, a-^^P- 
CTP, a-^^P-GTP, a--^P-UTP, a-^'S-dATP, y-^^S-GTP, y-^^P-dATP, and the like. 

25 Commercially available fluorescent nucleotide analogues readily incorporated into 

the nucleic acids of the present invention include Cy3-dCTP, Cy3-dUTP, Cy5-dCTP, Cy3- 
dUTP (Amersham Biosciences, Piscataway, New Jersey, USA), fluorescein- 12-dUTP, 
tetramethylrhodamine-6-dUTP, Texas Red®-5-dUTP, Cascade Blue®-7-dUTP, 
BODIPY® FL-14-dUTP, BODIPY® TMR-14-dUTP, BODIPY® TR-14-dUTP, 

30 Rhodamine Green™-5-dUTP, Oregon Green® 488-5-dUTP, Texas Red®-12-dUTP, 

BODIPY® 630/650-14-dUTP, BODIPY® 650/665- 14-dUTP, Alexa Fluor® 488-5-dUTP, 
Alexa Fluor® 532-5-dUTP, Alexa Fluor® -568-5-dUTP, Alexa Fluor® 594-5-dUTP, 
Alexa Fluor® 546-14-dUTP, fluorescein- 12-UTP, tetramethylrhodamine-6-UTP, Texas 
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Red®-5-UTP, Cascade Blue®-7-UTP, BODIPY® FL-14-UTP, BODIPY® TMR-M-UTP, 
BODIPY® TR-14-UTP, Rhodamine Green™.5-UTP, Alexa Fluor® 488-5-UTP, Alexa 
Fluor® 546-14-UTP (Molecular Probes, Inc. Eugene, OR, USA). One may also custom 
sjTithesize nucleotides having other fluorophores. See Henegariu et al. Nature 
5 Biotechnol 1 8: 345-348 (2000). 

Haptens that are commonly conjugated to nucleotides for subsequent labeling 
include biotin (biotin-1 1-dUTP, Molecular Probes, Inc., Eugene, OR, USA; 
biotin-21-UTP, biotin-21-dUTP, Clontech Laboratories, Inc., Palo Alto, CA, USA), 
digoxigenin (DIG-1 1-dUTP, alkali labile, DIG-ll-UTP, Roche Diagnostics Corp., 

10 Indianapolis, IN, USA), and dinitrophenyl (dinitrophenyl-l 1-dUTP, Molecular Probes, 
Inc., Eugene, OR, USA). 

Nucleic acid molecules of the present invention can be labeled by incorporation of 
labeled nucleotide analogues into the nucleic acid. Such analogues can be incorporated by 
enz>^matic polymerization, such as by nick translation, random priming, polymerase chain 

15 reaction (PGR), terminal transferase tailing, and end-filling of overhangs, for DNA 

molecules, and /;/ vitro transcription driven, e.g., from phage promoters, such as T7, T3, 
and SP6, for RNA molecules. Commercial kits are readily available for each such 
labeling approach. Analogues can also be incorporated during automated solid phase 
chemical synthesis. Labels can also be incorporated after nucleic acid synthesis, with the 

20 5' phosphate and 3' hydroxy 1 providing convenient sites for post-synthetic covalent 
attachment of detectable labels. 

Other post-synthetic approaches also permit internal labeling of nucleic acids. For 
example, fluorophores can be attached using a cisplatin reagent that reacts with the N7 of 
guanine residues (and, to a lesser extent, adenine bases) in DNA, RNA, and Peptide 

25 Nucleic Acids (PNA) to provide a stable coordination complex between the nucleic acid 
and fluorophore label (LIniversal Linkage System) (available from Molecular Probes, Inc., 
Eugene, OR, USA and Amersham Pharmacia Biotech, Piscataway, NJ, USA); see Alers et 
a/., Genes, Chromosomes & Cancer 25: 301- 305 (1999); Jelsma et al, 1 NIHRes. 5: 82 . 
(1994); Van Belkum et al, BioTechniques 16: 148-153 (1994). Alternatively, nucleic 

30 acids can be labeled using a disulfide-containing linker (FastTag^w Reagent, Vector 

Laboratories, Inc., Burlingame, CA, USA) that is photo- or thermally coupled to the target 
nucleic acid using aryl azide chemistry; after reduction, a free thiol is available for 
coupling to a hapten, fluorophore, sugar, affinity ligand, or other marker. 
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One or more independent or interacting labels can be incorporated into the nucleic 
acid molecules of the present invention. For example, both a fluorophore and a moiety 
that in proximit)^ thereto acts to quench fluorescence can be included to report specific 
hybridization through release of fluorescence quenching or to report exonucleotidic 
5 excision. See, e.g.. Tyagi et al, Nature Biotechnol 14: 303-308 (1996); Tyagi et ah. 
Nature Biotechnol 16: 49-53 (1998); Sokol etal, Proc. Natl Acad. ScL USA 95: 
1 1538-1 1543 (1998); Kostrikis et al, Science 279: 1228-1229 (1998); Marras et al. 
Genet, Anal 14: 151-156 (1999); Holland et al, Proc. Natl Acad. ScL USA 88: 
7276-7280 (1991); Heid etal, Genome Res. 6(10): 986-94 (1996); Kuimelis et al, 

10 Nucleic Acids Symp. Ser. (37): 255-6 (1997); and U.S. Patent Nos. 5,846,726, 5,925,517, 
5,925,517, 5,723,591 and 5,538,848, the disclosures of which are incorporated herein by 
reference in their entireties. 

Nucleic acid molecules of the present invention may also be modified by altering 
one or more native phosphodiester intemucleoside bonds to more nuclease-resistant, 

1 5 intemucleoside bonds, See Hartmann et al (eds.), Manual of Antisense Methodologv: 
Perspectives in Antisense Science . Kluwer Law International (1999); Stein et al (eds.), 
Applied Antisense Oligonucleotide Technology , Wiley-Liss (1998); Chadwick et al 
(eds.), Oligonucleotides as Therapeutic Agents - Symposium No. 209 . John Wiley & Son 
Ltd (1997). Such altered intemucleoside bonds are often desired for techniques or for 

20 targeted gene correction, Camper et al, Nucl Acids Res. 28(21): 4332-4339 (2000). For 
double-stranded RNA inhibition which may utilize either natural ds RNA or ds RNA 
modified in its, sugar, phosphate or base, see Hannon, Nature 41^11): 244-251 (2002); 
Fire et al in WO 99/32619; Tuschl et al in US2002/00S6356; Kruetzer et al in WO 
00/44895, the disclosures of which are incorporated herein by reference in their entirety. 

25 For circular antisense, see Kool in U.S. Patent No. 5,426,180, the disclosure of which is 
incorporated herein by reference in its entirety. 

Modified oligonucleotide backbones include, without limitation, 
phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, 
aminoalkylphosphotriesters, methyl and other alky! phosphonates including 3'-alk>'lene 

30 phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 

3 '-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, 
thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having 
normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity 
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wherein the adjacent pairs of nucleoside units are linked 3'-.5' to 5'-3' or 2'-5' to 5'-2'. 
Representative U.S. Patents that teach the preparation of the above phosphorus-containing 
linkages include, but are not limited to, U.S. Patent Nos. 3,687,808; 4,469,863; 4,476,301; 
5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276.019; 5,278,302; 5,286,717; 5,321,131; 
5 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 
5,541,306; 5,550,111; 5,563,253; 5,571,799; 5,587,361; and 5,625,050. the disclosures of 
which are incorporated herein by reference in their entireties. In a preferred embodiment, 
the modified intemucleoside linkages may be used for antisense techniques. 

Other modified oligonucleotide backbones do not include a phosphorus atom, but 

10 have backbones that are formed by short chain alkyi or cycloalkyl intemucleoside 

linkages, mixed heteroatom and alkj'l or cycloalkyl intemucleoside linkages, or one or 
more short chain heteroatomic or heterocyclic intemucleoside linkages. These include 
those having morpholino linkages (formed in part from the sugar portion of a nucleoside); 
siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl and 

1 5 thioformacetj'l backbones; methylene formacetyl and thioformacetyl backbones; alkene 
containing backbones; sulfamate backbones; methyleneimino and methylenehydrazino 
backbones; sulfonate and sulfonamide backbones; amide backbones; and others having 
mixed N, O, S and CH2 component parts. Representative U.S. patents that teach the 
preparation of the above backbones include, but are not limited to, U.S. Patent Nos. 

20 5,034,506; 5,166,315; 5.185,444; 5,214,134; 5,216.141; 5,235,033; 5,264,562; 5,264,564; 
5,405,938; 5.434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 
5,602,240; 5,610.289; 5,602,240; 5,608,046; 5,610,289; 5.618,704; 5,623,070; 5,663,312; 
5,633,360; 5,677,437 and 5,677,439; the disclosures of which are incorporated herein by 
reference in their entireties. 

25 In other preferred nucleic acid molecules, both the sugar and the intemucleoside 

linkage are replaced with novel groups, such as peptide nucleic acids (PNA). In PNA 
compounds, the phosphodiester backbone of the nucleic acid is replaced with an amide- 
containing backbone, in particular by repeating N-(2-aminoethyl) glycine units linked by 
amide bonds. Nucleobases are bound directly or indirectly to aza nitrogen atoms of the 

30 amide portion of the backbone, typically by methylene carbonyl linkages. PNA can be 
synthesized using a modified peptide synthesis protocol. PNA oligomers can be 
synthesized by both Fmoc and tBoc methods. Representative U.S. patents that teach the 
preparation of PNA compounds include, but are not limited to, U.S. Patent Nos. 
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5,539,082; 5,714,331; and 5,719,262, each of which is herein incorporated by reference in 
its entirety. Automated PNA s>Tithesis is readily achievable on commercial synthesizers 
{see, e.g., "PNA User^s Guide," Rev. 2, Februar>' 1998, Perseptive Biosystems Part No. 
60138, Applied Biosystems, Inc., Foster City, CA). PNA molecules are advantageous for 
5 a number of reasons. First, because the PNA backbone is uncharged, PN A/DNA and 
PNA/RNA duplexes have a higher thermal stability than is found in DNA/DNA and 
DNA/RNA duplexes. The Tm of a PNA/DNA or PNA/RNA duplex is generally rC 
higher per base pair than the Tm of the corresponding DRATDNA or DRVRNA duplex 
(in 100 mM NaCl). Second, PNA molecules can also form stable PNA/DNA complexes 

10 at low ionic strength, under conditions in which DKVDNA duplex formation does not 
occur. Third, PNA also demonstrates greater specificity in binding to complementary 
DNA because a PNA/DNA mismatch is more destabilizing than DNA/DNA mismatch. A 
single mismatch in mixed a PNA/DNA 15-mer lowers the Tm by S-20°C (15^C on 
average). In the corresponding DNA/DNA duplexes, a single mismatch lowers the Tm by 

15 4-1 6°C (1 l^C on average). Because PNA probes can be significantly shorter than DNA 
probes, their specificity is greater. Fourth, PNA oligomers are resistant to degradation by 
enzj'mes, and the lifetime of these compounds is extended both //; vivo and in vitro 
because nucleases and proteases do not recognize the PNA polyamide backbone with 
nucleobase sidechains. See, e.g., Ray et al, FASEB J. 14(9): 1041-60 (2000); Nielsen et 

20 a/., Pharmacol Toxicol 86(1): 3-7 (2000); Larsen et ah, Biochim Biophys Acta, 1489(1): 
159-66 (1999); Nielsen, Curr. Opin. Struct. Biol 9(3): 353-7 (1999), and Nielsen, Curr, 
Opiih Biotechnol 10(1): 71-5 (1999). 

Nucleic acid molecules may be modified compared to their native structure 
throughout the length of the nucleic acid molecule or can be localized to discrete portions 

25 thereof As an example of the latter, chimeric nucleic acids can be synthesized that have 
discrete DNA and RNA domains and that can be used for targeted gene repair and 
modified PGR reactions, as further described in, Misra et al, Biocliem, 37: 1917-1925 
(1998); and Finn et al, Nucl Acids Res, 24: 3357-3363 (1996), and U.S. Patent Nos. 
5,760,012 and 5,731,181, the disclosures of which are incorporated herein by reference in 

30 their entireties. 

Unless otherwise specified, nucleic acid molecules of the present invention can 
- include any topological conformation appropriate to the desired use; the term thus 
explicitly comprehends, among others, single-stranded, double-stranded, triplexed, 
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quadruplexed, partially double-stranded, partially-triplexed, partially-quadruplexed, 
branched, hairpinned, circular, and padlocked conformations. Padlocked conformations 
and their utilities are further described in Baner et al.. Cum Opitu Biotechiol 12: 11-15 
(2001); Escude et al, Proc, Nad. Acad. Sci. USA 14: 96(19):10603-7 (1999); and Nilsson 
5 et aiy Science 265(5181): 20S5-8 (1994). Triplexed and quadruplexed conformations, and 
their utilities, are reviewed in Praseuth et ai, Biochim. Biophys. Acta. 1489(1): 181-206 

(1999) ; Fox, Curr. Med. Chem. 7(1): 17-37 (2000); Kochetkova et al. Methods Mol Biol 
130: 189-201 (2000); Chan et al^l Mol Med. 75(4): 267-82 (1997); Rowley et ai, Mol 
Med 5{10): 693-700 (1999); Kool, Annu Rev Biophys Biomol Struct, 25: 1-28 (1996). 

1 0 SNP Polymorphisms 

Commonly, sequence differences between individuals involve differences in single 
nucleotide positions. SNPs may account for 90% of human DNA polymorphism. Collins 
et ai, 8 Genome Res. 1229-31 (1998). SNPs include single base pair positions in genomic 
DNA at which different sequence alternatives (alleles) exist in a population. In addition, 

15 the least frequent allele generally must occur at a frequency of 1% or greater. DNA 
sequence variants with a reasonably high population frequency are observed 
approximately every 1,000 nucleotide across the genome, with estimates as high as 1 SNP 
per 350 base pairs. Wang et ai, 280 Science 1077-82 (1998); Harding et ai, 60 Am. J. 
Human Genet 772-89 (1997); Taillon-Miller et aL, 8 Genome Res. 748-54 (1998); Cargill 

20 et a/., 22 Nat, Genet, 231-38 (1999); and Semple et al, 16 Bioinform, Disc, Note 735-38 

(2000) . The frequency of SNPs varies with the type and location of the change. In base 
substitutions, Uvo-thirds of the substitutions involve the C-T and G-A type. This variation 
in frequency can be related to 5-methyIcytosine deamination reactions that occur 
frequently, particularly at CpG dinucleotides. Regarding location, SNPs occur at a much 

25 higher frequency in non-coding regions than in coding regions. Information on over one 
million variable sequences is already publicly available via the Internet and more such 
markers are available from commercial providers of genetic information. Kwok and Gu, 5 
Med. Tbrfay 538-53 (1999). 

Several definitions of SNPs exist. See, e.g., Brooks, 235 Gene 177-86 (1999). As 

30 used herein, the term "single nucleotide polymorphism" or "SNP" includes all single base 
variants, thus including nucleotide insertions and deletions in addition to single nucleotide 
substitutions. There are two types of nucleotide substitutions. A transition is the 
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replacement of one purine by another purine or one pyrimidine by another pyrimidine. A 
transversion is the replacement of a purine for a pyrimidine, or vice versa. 

Numerous methods exist for detecting SNPs within a nucleotide sequence. A 
review of many of these methods can be found in Landegren et al, S Genome Res, 169-16 
5 (1998). For example, a SNP in a genomic sample can be detected by preparing a Reduced 
Complexity Genome (RCG) from the genomic sample, then analyzing the RCG for the 
presence or absence of a SNP. See, e.g., WO 00/18960 which is herein incorporated by 
reference in its entiretj'. Multiple SNPs in a population of target polynucleotides in parallel 
can be detected using, for example, the methods of WO 00/50869 which is herein 

10 incorporated by reference in its entirety. Other SNP detection methods include the 
methods of U.S. Pat. Nos. 6,297,018 and 6,322,980 which are herein incorporated by 
reference in their entirety. Furthermore, SNPs can be detected by restriction fragment 
length polymorphism (RFLP) analysis. See, e.g., U.S. Pat. Nos. 5,324,631; 5,645,995 
which are herein incorporated by reference in their entirety. RFLP analysis of SNPs, 

1 5 however, is limited to cases where the SNP either creates or destroys a restriction enzjone 
cleavage site. SNPs can also be detected by direct sequencing of the nucleotide sequence 
of interest. In addition, numerous assays based on hybridization have also been developed 
to detect SNPs and mismatch distinction by polymerases and ligases. Several web sites 
provide information about SNPs including Ensembl on the World Wide Web at 

20 ensemble.org, Sanger Institute on the World Wide Web at sanger.ac.uk/genetics/exon/, 
National Center for Biotechnology Information (NCBI) on the Worid Wide Web at 
ncbi.nlm.nih.gov/SNP/ , The SNP Consortium Ltd. on the World Wide Web at 
snp.cshl.org. The chromosomal locations for the compositions disclosed herein are 
provided below. In addition, one of ordinary skill in the art could use a BLAST against 

25 the genome or any of the databases cited above to find the chromosomal location. 

Another a preferred method to find the genomic coordinates and associated SNPs would 
be to use the BLAT tool (genome.ucsc.edu, Kent et al. 2001, The Human Genome 
Browser at UCSC, Genome Research 996-1006 or Kent 2002 BLAT —The BLAST -Like 
Alignment Tool Genome Reseach, 1-9 ). All web sites above were accessed December 3, 

30 2003. 

RNA interference 
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RNA interference refers to the process of sequence-specific post transcriptional 
gene silencing in animals mediated by short interfering RNAs (siRNA). Fire et al, 1998, 
Nature^ 391, 806. The corresponding process in plants is commonly referred to as post 
transcriptional gene silencing or RNA silencing and is also referred to as quelling in fungi. 
5 The process of post transcriptional gene silencing is thought to be an evolutionarily 
conser%'ed cellular defense mechanism used to prevent the expression of foreign genes 
which is commonly shared by diverse flora and phyla. Fire et al, 1999, Trends Genet, 
1 5, 358. Such protection from foreign gene expression may have evolved in response to 
the production of double-stranded RNAs (dsRNA) derived from viral infection or the 

10 random integration of transposon elements into a host genome via a cellular response that 
specifically destroys homologous single-stranded RNA or viral genomic RNA. The 
presence of dsRNA in cells triggers the RNAi response though a mechanism that has yet 
to be fully characterized. This mechanism appears to be different from the interferon 
response that results from dsRNA mediated activation of protein kinase PKR and 2',5'- 

15 oligoadenylate synthetase resulting in non-specific cleavage of mRNA by ribonuclease L. 

The presence of long dsRNAs in cells stimulates the activity of a ribonuclease III 
enzyme referred to as dicer. Dicer is involved in the processing of the dsRNA into short 
pieces of dsRNA known as short interfering RNAs (siRNA). Berstein et al, 2001, 
Nature, 409, 363. Short interfering RNAs derived from dicer activity are typically about 

20 21-23 nucleotides in length and comprise about 19 base pair duplexes. Dicer has also been 
implicated in the excision of 21 and 22 nucleotide small temporal RNAs (stRNA) from 
precursor RNA of conserved structure that are implicated in translational control. 
Hutvagner et al, 2001, Science, 293, 834. The RNAi response also features an 
endonuclease complex containing a siRNA, commonly referred to as an RNA-induced 

25 silencing complex (RISC), which mediates cleavage of single-stranded RNA having 
sequence complementary to the antisense strand of the siRNA duplex. Cleavage of the 
target RNA takes place in the middle of the region complementary to the antisense strand 
of the siRNA duplex. Elbashire/fl/., 2001, Genes Dev,, 15, 188. 

Short interfering RNA mediated RNAi has been studied in a variety of systems. 
30 Fire et al, 1998, Nature, 39K 806, were the first to observe RNAi in C. Elegans. Wianny 
and Goetz, 1999, Nature Cell Biol, 2, 70, describe RNAi mediated by dsRNA in mouse 
embryos. Hammond et al, 2000, Nature, 404, 293, describe RNAi in Drosophila cells 
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transfected with dsRNA. Elbashir et al , 200 1 , Nature, 411, 494, describe RNAi induced 
by introduction of duplexes of synthetic 21 -nucleotide RNAs in cultured mammalian cells 
including human embryonic kidney and HeLa cells. Recent work in Drosophila embryonic 
lysates (Elbashir et al, 2001, EMBOl, 20, 6S77) has revealed certain requirements for 
5 siRNA length, structure, chemical composition, and sequence that are essential to mediate 
efficient RNAi activity. These studies have shown that 21 nucleotide siRNA duplexes are 
most active when containing two nucleotide 3*-overhangs. Furthermore, complete 
substitution of one or both siRNA strands with 2'-deoxy (2 -H) or 2'-0-methyl nucleotides 
abolishes RNAi activitj', whereas substitution of the 3*-tenninal siRNA overhang 

10 nucleotides with deoxy nucleotides (2'-H) was shown to be tolerated. Single mismatch 
sequences in the center of the siRNA duplex were also shown to abolish RNAi activitj'. In 
addition, these studies also indicate that the position of the cleavage site in the target RNA 
is defined by the 5'-end of the siRNA guide sequence rather than the 3*-end. Elbashir et 
al, 2001, EMBOJ.y 20, 6S77. Other studies have indicated that a 5'-phosphate on the 

15 target-complementary strand of a siRNA duplex is required for siRNA activity and that 
ATP is utilized to maintain the 5'-phosphate moiety on the siRNA. Nykanen et al., 2001, 
Cell, 107, 309. 

Studies have shown that replacing the 3'-overhanging segments of a 21-mer siRNA 
duplex having 2 nucleotide 3' overhangs with deo.xyribonucleotides does not have an 

20 adverse effect on RNAi activity. Replacing up to 4 nucleotides on each end of the siRNA 
with deoxyribonucleotides has been reported to be well tolerated whereas complete 
substitution with deoxyribonucleotides results in no RNAi activity. Elbashir et al, 2001, 
EMBO J. , 20, 6877. In addition, Elbashir et al., supra, also report that substitution of 
siRNA with 2'-0-methyl nucleotides completely abolishes RNAi activitj'. Li et al., WO 

25 00/44914. and Beach et al., WO 01/68836 both suggest that siRNA "may include 

modifications to either the phosphate-sugar back bone or the nucleoside to include at least 
one of a nitrogen or sulfur heteroatom", however neither application teaches to what extent 
these modifications are tolerated in siRNA molecules nor provides any examples of such 
modified siRNA. Kreutzer and Limmer, Canadian Patent Application No. 2,359,180, also 

30 describe certain chemical modifications for use in dsRNA constructs in order to counteract 
activation of double-stranded RNA-dependent protein kinase PKR, specifically 2'-amino 
or 2*-0-methyl nucleotides, and nucleotides containing a 2*-0 or 4'-C methylene bridge. 
However, Kreutzer and Limmer similarly fail to show to what extent these modifications 
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are tolerated in siRNA molecules nor do they provide any examples of such modified 
siRNA. 

Parrish et al, 2000, Molecular Cell, 6, 1977-1087, tested certain chemical 
modifications targeting the unc-22 gene in C. elegans using long (>25 nt) siRNA 
5 transcripts. The authors describe the introduction of thiophosphate residues into these 
siRNA transcripts by incorporating thiophosphate nucleotide analogs with T7 and T3 
RNA polymerase and obser\'ed that "RNAs with nvo [phosphorothioate] modified bases 
also had substantial decreases in effectiveness as RNAi triggers; [phosphorothioate] 
modification of more than nvo residues greatly destabilized the RNAs in vitro and we 

10 were not able to assay interference activities." Parrish et al. at 1081. The authors also 
tested certain modifications at the 2'-position of the nucleotide sugar in the long siRNA 
transcripts and obser\^ed that substituting deoxynucleoiides for ribonucleotides "produced 
a substantial decrease in interference activit>'", especially in the case of Uridine to 
Thymidine and/or Cytidine to deoxy-Cytidine substitutions. Parrish et al In addition, the 

1 5 authors tested certain base modifications, including substituting 4-thiouracil, 5- 

bromouracil, 5-iodouracil, 3-(aminoallyl)uracil for uracil, and inosine for guanosine in 
sense and antisense strands of the siRNA, and found that whereas 4-thiouracil and 5- 
bromouracil were all well tolerated, inosine "produced a substantial decrease in 
interference activit>'" \\'hen incorporated in either strand. Incorporation of 5-iodouracil and 

20 3-(aminoallyl)uracil in the antisense strand resulted in substantial decrease in RNAi 
activity as well. 

Beach et al., WO 01/68836, describes specific methods for attenuating gene 
expression using endogenously derived dsRNA. Tuschl et al., WO 01/75164, describes a 
Drosophila in vitro RNAi system and the use of specific siRNA molecules for certain 

25 functional genomic and certain therapeutic applications; although Tuschl, 2001 , Chem. 
Biocliem.y 2, 239-245, doubts that RNAi can be used to cure genetic diseases or viral 
infection due "to the danger of activating interferon response". Li et al., WO 00/44914, 
describes the use of specific dsRNAs for use in attenuating the expression of certain target 
genes. Zemicka-Goetz et al., WO 01/36646, describes certain methods for inhibiting the 

30 expression of particular genes in mammalian cells using certain dsRNA molecules. Fire et 
al, WO 99/32619, U.S. Patent No. 6,506,559, the contents of which are hereby 
incorporated by reference in their entirety, describes particular methods for introducing 
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certain dsRNA molecules into cells for use in inhibiting gene expression. Plaetinck et al., 
WO 00/01 S46, describes certain methods for identifying specific genes responsible for 
conferring a particular phenot>T}e in a cell using specific dsRNA molecules, Mello et al, 
WO 01/2905S, describes the identification of specific genes involved in dsRNA mediated 
5 RNAi. Deschamps Depaillette et aL, International PCT Publication No. WO 99/07409, 
describes specific compositions consisting of particular dsRNA molecules combined with 
certain anti-viral agents. Driscoll et al., International PCT Publication No, WO 01/49844, 
describes specific DNA constructs for use in facilitating gene silencing in targeted 
organisms. Parrish et al., 2000, Molecular Cell, 6, 1977-1087, describes specific 
10 chemically modified siRNA constructs targeting the unc-22 gene of C. elegans. Tuschl et 
al., International PCT Publication No. WO 02/44321, describe certain synthetic siRNA 
constructs. 

Methods for Using Nucleic Acid Molecules as Probes and Primers 
The isolated nucleic acid molecules of the present invention can be used as 
1 5 hybridization probes to detect, characterize, and quantify hybridizing nucleic acids in, and 
isolate hybridizing nucleic acids from, both genomic and transcript-derived nucleic acid 
samples. Wlien free in solution, such probes are typically, but not invariably, detectably 
labeled; bound to a substrate, as in a microarray, such probes are typically, but not 
invariably unlabeled. 

20 In one embodiment, the isolated nucleic acid molecules of the present invention 

can be used as probes to detect and characterize gross alterations in the gene of a CSNA, 
such as deletions, insertions, translocations, and duplications of the CSNA genomic locus 
through fluorescence situ hybridization (FISH) to chromosome spreads. See, e.g., 
Andreeff al (eds.), Introduction to Fluorescence In Situ Hybridization: Principles and 

25 Clinical Applications. John Wiley & Sons (1999). The isolated nucleic acid molecules of 
the present invention can be used as probes to assess smaller genomic alterations using, 
eg., Southern blot detection of restriction fragment length polymorphisms. The isolated 
nucleic acid molecules of the present invention can be used as probes to isolate genomic 
clones that include a nucleic acid molecule of the present invention, which thereafter can 

30 be restriction mapped and sequenced to identify deletions, insertions, translocations, and 
substitutions (single nucleotide polymorphisms, SNPs) at the sequence level 
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Alternatively, detection techniques such as molecular beacons may be used, see Kostrikis 
et al Science 279:1228-1229 (1998). 

The isolated nucleic acid molecules of the present invention can also be used as 
probes to detect, characterize, and quantify CSNA in, and isolate CSNA from, transcript- 
5 derived nucleic acid samples. In one embodiment, the isolated nucleic acid molecules of 
the present invention can be used as hybridization probes to detect, characterize by length, 
and quantify mRNA by Northern blot of total or poly-A"^- selected RNA samples. In 
another embodiment, the isolated nucleic acid molecules of the present invention can be 
used as hybridization probes to detect, characterize by location, and quantify niRNA by in 

10 situ hybridization to tissue sections. See, e.g,, Schwarchzacher et al. In Situ 

Hybridization , Springer- Verlag New York (2000). In another preferred embodiment, the 
isolated nucleic acid molecules of the present invention can be used as hybridization 
probes to measure the representation of clones in a cDNA library or to isolate hybridizing 
nucleic acid molecules acids from cDNA libraries, permitting sequence level 

15 characterization of mRNAs that hybridize to CSNAs, including, without limitations, 

identification of deletions, insertions, substitutions, truncations, alternatively spliced forms 
and single nucleotide polymorphisms. In yet another preferred embodiment, the nucleic 
acid molecules of the instant invention may be used in microarrays. 

All of the aforementioned probe techniques are well within the skill in the art, and 

20 are described at greater length in standard texts such as Sambrook (2001), supra\ Ausubel 
(1999), supra\ and Walker et al (eds.), The Nucleic Acids Protocols Handbook , Humana 
Press (2000). 

In another embodiment, a nucleic acid molecule of the invention may be used as a 
probe or primer to identify and/or amplif^' a second nucleic acid molecule that selectively 

25 hybridizes to the nucleic acid molecule of the invention. In this embodiment, it is 
preferred that the probe or primer be derived from a nucleic acid molecule encoding a 
CSP. More preferably, the probe or primer is derived from a nucleic acid molecule 
encoding a poljipeptide having an amino acid sequence of SEQ ID NO: 96-237. Also 
prefened are probes or primers derived from a CSNA. More preferred are probes or 

30 primers derived from a nucleic acid molecule having a nucleotide sequence of SEQ ID 
NO: 1-95. 

In general; a probe or primer is at least 10 nucleotides in length, more preferably at 
least 12, more preferably at least 14 and even more preferably at least 16 or 17 nucleotides 
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in length. In an even more preferred embodiment, the probe or primer is at least IS 
nucleotides in length, even more preferably at least 20 nucleotides and even more 
preferably at least 22 nucleotides in length. Primers and probes may also be longer in 
length. For instance, a probe or primer may be 25 nucleotides in length, or may be 30. 40 
5 or 50 nucleotides in length. Methods of performing nucleic acid hybridization using 
oligonucleotide probes are well known in the art. See, e.g., Sambrook et al., 1989, supra, 
Chapter 11 and pp. 11.31-11.32 and 1 1.40-11.44, which describes radiolabeling of short 
probes, and pp. 1 1.45-1 1.53, which describe hybridization conditions for oligonucleotide 
probes, including specific conditions for probe hybridization (pp. 11.50-11.51). 

1 0 Methods of performing primer-directed amplification are also well known in the 

art. Methods for performing the polymerase chain reaction (PCR) are compiled, inter alia, 
in McPherson, PCR Basics: From Background to Bench . Springer Verlag (2000); Innis et 
^l- (eds.), PCR Applications: Protocols for Functional Genomics . Academic Press (1999); 
Gelfand et al. (eds.), PCR Strategies . Academic Press (1998); Newton et al, PCR, 

1 5 Springer-Verlag New York (1 997); Burke (ed.), PCR: Essential Techniques. John Wiley 
it Son Ltd (1996); \\Tiite (ed.), PCR Cloning Protocols: From Molecular Cloning to 
Genetic Engineering . Vol. 67, Humana Press (1996); and McPherson et al (eds.), PCR 2: 
A Practical Approach . Oxford University Press, Inc. (1995). Methods for perfonning RT- 
PCR are collected, e.g., in Siebert et al (eds.), Gene Cloning and Analysis bv RT-PCR . 

20 Eaton Publishing Company/Bio Techniques Books Division, 1998; and Siebert (ed.), PCR 
Technique:RT-PCR . Eaton Publishing Company/ BioTechniques Books (1995). 

PCR and hybridization methods may be used to identify and/or isolate nucleic acid 
molecules of the present invention including allelic variants, homologous nucleic acid 
molecules and fragments. PCR and hybridization methods may also be used to identify, 

25 amplifi' and/or isolate nucleic acid molecules of the present invention that encode 

homologous proteins, analogs, fusion proteins or muteins of the invention. Nucleic acid 
primers as described herein can be used to prime amplification of nucleic acid molecules 
of the invention, using transcript-derived or genomic DNA as the template. 

These nucleic acid primers can also be used, for example, to prime single base 

30 extension (SEE) for SNP detection {See, e.g., U.S. Pat. No. 6,004,744, the disclosure of 
which is incorporated herein by reference in its entirety). 

Isothermal amplification approaches,, such as rolling circle amplification, are also 
now well-described. See, e.g,, Schweitzer ^/ a/., Ciirr, Opiiu Biotechnol 12(1): 21-7 
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(2001); International Patent publications WO 97/19193 and WO 00/15779, and U.S. 
Patent Nos. 5,854,033 and 5,714,320, the disclosures of which are incorporated herein by 
reference in their entireties. Rolling circle amplification can be combined with other 
techniques to facilitate SNP detection. See, e.g., Lizardi et al, Nature Genet 19(3): 
5 225-32 (1998). 

Nucleic acid molecules of the present invention may be bound to a substrate either 
covalently or noncovalently. The substrate can be porous or solid, planar or non-planar, 
unitary or distributed. The bound nucleic acid molecules may be used as hybridization 
probes, and may be labeled or unlabeled. In a preferred embodiment, the bound nucleic 

1 0 acid molecules are unlabeled. 

In one embodiment, the nucleic acid molecule of the present invention is bound to 
a porous substrate, e.g., a membrane, tjTpically comprising nitrocellulose, nylon, or 
positively charged derivatized nylon. The nucleic acid molecule of the present invention 
can be used to detect a hybridizing nucleic acid molecule that is present within a labeled 

15 nucleic acid sample, e.g., a sample of transcript-derived nucleic acids. In another 

embodiment, the nucleic acid molecule is bound to a solid substrate, including, without 
limitation, glass, amorphous silicon, ctystalline silicon or plastics. Examples of plastics 
include, without limitation, poljonethylaciylic, polyethylene, polypropylene, polyacr>'late, 
polymethylmethacr>'late, poly\dnylchloride, polytetrafluoroethylene, polystyrene, 

20 polycarbonate, polyacetal, polysulfone, celluloseacetate, cellulosenitrate, nitrocellulose, or 
mixtures thereof. The solid substrate may be any shape, including rectangular, disk-like 
and spherical. In a preferred embodiment, the solid substrate is a microscope slide or 
slide-shaped substrate. 

The nucleic acid molecule of the present invention can be attached covalently to a 

25 surface of the support substrate or applied to a derivatized surface in a chaotropic agent 
that facilitates denaturation and adherence by presumed noncovalent interactions, or some 
combination thereof. The nucleic acid molecule of the present invention can be bound to a 
substrate to which a plurality of other nucleic acids are concurrently bound, hj'bridization 
to each of the pluralit>' of bound nucleic acids being separately detectable. At low density, 

30 e.g. on a porous membrane, these substrate-bound collections are typically denominated 
macroarrays; at higher density, typically on a solid support, such as glass, these substrate 
bound collections of plural nucleic acids are colloquially termed microarrays. As used 
herein, the term microarray includes arrays of all densities. It is, therefore, another aspect 
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of the invention to provide microarrays that comprise one or more of the nucleic acid 
molecules of the present invention. 

In yet another embodiment, the invention is directed to single exon probes based 
on the CSNAs disclosed herein. 

5 Expression Vectors, Host Cells and Recombinant Methods of Producing 

Polypeptides 

Another aspect of the present invention provides vectors that comprise one or more 
of the isolated nucleic acid molecules of the present invention, and host cells in which 
such vectors have been introduced. 

10 The vectors can be used, inter alia, for propagating the nucleic acid molecules of 

the present invention in host cells (cloning vectors), for shuttling the nucleic acid 
molecules of the present invention between host cells derived from disparate organisms 
(shuttle vectors), for inserting the nucleic acid molecules of the present invention into host 
cell chromosomes (insertion vectors), for expressing sense or antisense RNA transcripts of 

1 5 the nucleic acid molecules of the present invention in vitro or within a host cell, and for 
expressing polypeptides encoded by the nucleic acid molecules of the present invention, 
alone or as fusion proteins with heterologous polypeptides (expression vectors). Vectors 
are by now well known in the art, and are described, inter alia, in Jones et al (eds.), 
Vectors: Cloning Applications: Essential Techniques (Essential Techniques Series), John 

20 Wiley & Son Ltd. (1998); Jones et al (eds.), Vectors: Expression Svstems: Essential 
Techniques (Essential Techniques Series), John Wiley & Son Ltd. (1998); Gacesa et ai. 
Vectors: Essential Data . John Wiley & Sons Ltd. (1995); Cid-Arregui (eds.), Viral 
Vectors: Basic Science and Gene Therapy. Eaton Publishing Co. (2000); Sambrook 
(2001), sup}'a\ Ausubel (1999), supra. Furthermore, a variety of vectors are available 

25 commercially. Use of existing vectors and modifications thereof are well within the skill 
in the art. Thus, only basic features need be described here. 

Nucleic acid sequences may be expressed by operatively linking them to an 
expression control sequence in an appropriate expression vector and employing that 
expression vector to transform an appropriate unicellular host. Expression control 

30 sequences are sequences that control the transcription, post-transcriptional events and 

translation of nucleic acid sequences. Such operative linking of a nucleic acid sequence of 
this invention to an expression control sequence, of course, includes, if not already part of 
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the nucleic acid sequence, the provision of a translation initiation codon, ATG or GTG, in 
the correct reading frame upstream of the nucleic acid sequence. 

A wide variety of host/expression vector combinations may be employed in 
expressing the nucleic acid sequences of this invention. Useftil expression vectors, for 
5 example, may consist of segments of chromosomal, non-chromosomal and synthetic 
nucleic acid sequences. 

In one embodiment, prokaryotic cells may be used with an appropriate vector. 
Prokaryotic host cells are often used for cloning and expression. In a preferred 
embodiment, prokar>'otic host cells include E, coli, Pseudomouas, Bacillus and 
10 Streptomyces, In a preferred embodiment, bacterial host cells are used to express the 
nucleic acid molecules of the instant invention. Useful expression vectors for bacterial 
hosts include bacterial plasmids, such as those from E. colU Bacillus or Streptomyces, 
including pBluescript, pGEX-2T, pUC vectors, col El, pCRl, pBR322, pMB9 and their 
derivatives, wider host range plasmids, such as RP4, phage DNAs, e.g., the numerous 
15 derivatives of phage lambda, e.g., NM9S9, AGTIO and AGTl 1, and other phages, e.g., 
M 13 and filamentous single-stranded phage DNA. Where £, coli is used as host, 
selectable markers are, analogously, chosen for selectivity in gram negative bacteria: eg., 
tjpical markers confer resistance to antibiotics, such as ampicillin, tetracycline, 
chloramphenicol, kanamycin, streptomycin and zeocin; auxotrophic markers can also be 
20 used. 

In other embodiments, eukar>'olic host cells, such as yeast, insect, mammalian or 
plant cells, may be used. Yeast cells, t>'pically 5. cerevisiae, are useful for eukaryotic 
genetic studies, due to the ease of targeting genetic changes by homologous recombination 
and the ability to easily complement genetic defects using recombinantly expressed 

25 proteins. Yeast cells are useful for identifying interacting protein components, eg. 

through use of a Uvo-hybrid system. In a preferred embodiment, yeast cells are usefiil for 
protein expression. Vectors of the present invention for use in yeast will typically, but not 
invariably, contain an origin of replication suitable for use in yeast and a selectable marker 
that is functional in yeast. Yeast vectors include Yeast Integrating plasmids (eg., \lp5) 

30 and Yeast Replicating plasmids (the YRp and YEp series plasmids). Yeast Centromere 
plasmids (the YCp series plasmids), Yeast Artificial Chromosomes (YACs) which are 
based on yeast linear plasmids, denoted YLp, pGED-2, 2p plasmids and derivatives 
thereof, and improved shuttle vectors such as those described in Gietz et ai. Gene, 74: 
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527-34 (1988) (YIplac, ^^plac and YCplac). Selectable markers in yeast vectors include 
a variety of auxotrophic markers, the most common of which are (in Saccharomyces 
cerevisiae) URA3, HISS, LEU2, TRPl and LYS2, which complement specific 
auxotrophic mutations, such as ura3-52, his3-Dl, leu2-DU trpl-Dl and lys2-201. 
5 Insect cells may be chosen for high efficiency protein expression. Where the host 

cells are from Spodoptera frugiperda, e.g., Sf9 and Sf21 cell lines, and expresSF™ cells 
(Protein Sciences Corp., Meriden, CT, USA), the vector replicative strateg>' is typically 
based upon the baculo virus life cycle. Typically, baculovirus transfer vectors are used to 
replace the wild-type AcMNPV polyhedrin gene with a heterologous gene of interest. 

10 Sequences that flank the polyhedrin gene in the wild-type genome are positioned 5' and 3' 
of the expression cassette on the transfer vectors. Following co-transfection with 
AcMNPV DNA, a homologous recombination event occurs between these sequences 
resulting in a recombinant virus camming the gene of interest and the polyhedrin or plO 
promoter. Selection can be based upon visual screening for lacZ fusion activit)'. 

15 The host cells may also be mammalian cells, which are particularly useful for 

expression of proteins intended as pharmaceutical agents, and for screening of potential 
agonists and antagonists of a protein or a physiological pathway. Mammalian vectors 
intended for autonomous extrachromosomal replication will typically include a viral 
origin, such as the S V40 origin (for replication in cell lines expressing the large T-antigen, 

20 such as COSl and COS? cells), the papillomavirus origin, or the EBV origin for long term 
episomal replication (for use, eg., in 293-EBNA cells, which constitutively express the 
EBV EBNA-1 gene product and adenovirus El A). Vectors intended for integration, and 
thus replication as part of the mammalian chromosome, can, but need not, include an 
origin of replication functional in mammalian cells, such as the SV40 origin. Vectors 

25 based upon viruses, such as adenovirus^ adeno-associated virus, vaccinia virus, and 

various mammalian retroviruses, will t>'pically replicate according to the viral replicative 
strategy. Selectable markers for use in mammalian cells include, but are not limited to, 
resistance to neomycin (G418), blasticidin, hygromycin and zeocin, and selection based 
upon the purine salvage pathway using HAT medium. 

30 Expression in mammalian cells can be achieved using a variety of plasmids, 

including pSV2, pBC12BI, and p91023, as well as lytic virus vectors (e.g., vaccinia virus, 
adeno virus, and baculovirus), episomal virus vectors* (e.g.; bovine papillomavirus), and 
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retroviral vectors (e.g., murine retroviruses). Useful vectors for insect cells include 
baculoviral vectors and pVL 941. 

Plant cells can also be used for expression, with the vector replicon typically 
derived from a plant virus (e.g., cauliflower mosaic virus, CaKfV; tobacco mosaic virus, 
5 TM\0 and selectable markers chosen for suitability' in plants. 

It is known that codon usage of different host cells may be different. For example, 
a plant cell and a human cell may exhibit a difference in codon preference for encoding a 
particular amino acid. As a result, human mRNA may not be efficiently translated in a 
plant, bacteria or insect host cell. Therefore, another embodiment of this invention is 
10 directed to codon optimization. The codons of the nucleic acid molecules of the invention 
may be modified to resemble, as much as possible, genes naturally contained within the 
host cell without altering the amino acid sequence encoded by the nucleic acid molecule. 

Any of a wide variety of expression control sequences may be used in these 
vectors to express the nucleic acid molecules of this invention. Such useful expression 
1 5 control sequences include the expression control sequences associated with structural 
genes of the foregoing expression vectors. Expression conttol sequences that control 
transcription include, e.g., promoters, enhancers and transcription termination sites. 
Expression control sequences in eukaryotic cells that control post-transcriptional events 
include splice donor and acceptor sites and sequences that modify the half-life of the 
20 transcribed KNA, e.g., sequences that direct poly(A) addition or binding sites for RNA- 
binding proteins. Expression control sequences that control translation include ribosome 
binding sites, sequences which direct targeted expression of the polypeptide to or within 
particular cellular compartments, and sequences in the 5' and 3' untranslated regions that 
modify the rate or efficiency of translation. 
25 Examples of useful expression control sequences for a prokaryote, e.g., E, coli, 

will include a promoter, often a phage promoter, such as phage lambda pL promoter, the 
trc promoter, a hybrid derived from the trp and lac promoters, the bacteriophage T7 
promoter (in E. coli cells engineered to express the T7 polymerase), the TAC or TEC 
system, the major operator and promoter regions of phage lambda, the control regions of 
30 fd coat protein, and the araB AD operon. Prokaryotic expression vectors may further 

include transcription temiinators, such as the aspA terminator, and elements that facilitate 
translation, such as a consensus ribosome binding site and translation termination codon, 
Schomer^/fi/., Proc. Natl. Acad. ScL USA S3: 8506-8510 (1986). 
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Expression control sequences for yeast cells, typically S. cerevisiae, will include a 
yeast promoter, such as the CYCl promoter, the GALl promoter, the GALIO promoter, 
ADHl promoter, the promoters of the yeast a-mating system, or the GPD promoter, and 
will typically have elements that facilitate transcription termination, such as the 
5 transcription termination signals from the CYCl or ADHl gene. 

Expression vectors useful for expressing proteins in mammalian cells will include 
a promoter active in mammalian cells. These promoters include, but are not limited to, 
those derived from mammalian viruses, such as the enhancer-promoter sequences from the 
immediate early gene of the human cjlomegalovirus (CMV), the enhancer-promoter 

10 sequences from the Rous sarcoma virus long terminal repeat (RSV LTR), the enhancer- 
promoter from SV40 and the early and late promoters of adenovirus. Other expression 
control sequences include the promoter for 3-phosphoglycerate kinase or other glycolytic 
enz>'mes, the promoters of acid phosphatase. Other expression conn-ol sequences include 
those from the gene comprising the CSNA of interest. Often, expression is enhanced by 

1 5 incoq>oration of polyadenylation sites, such as the late SV40 polyadenylation site and the 
polyadenylation signal and transcription termination sequences from the bovine growth 
hormone (BGH) gene, and ribosome binding sites. Furthermore, vectors can include 
introns, such as intron II of rabbit p-globin gene and the SV40 splice elements. 

Preferred nucleic acid vectors also include a selectable or amplifiable marker gene 

20 and means for amplifying the copy number of the gene of interest. Such marker genes are 
well known in the art. Nucleic acid vectors may also comprise stabilizing sequences {e.g., 
ori- or ARS-like sequences and telomere-like sequences), or may alternatively be designed 
to favor directed or non-directed integration into the host cell genome. In a preferred 
embodiment, nucleic acid sequences of this invention are inserted in frame into an 

25 expression vector that allows a high level expression of an RNA which encodes a protein 
comprising the encoded nucleic acid sequence of interest. Nucleic acid cloning and 
sequencing methods are well known to those of skill in the art and are described in an 
assortment of laboratory manuals, including Sambrook (1989), supra, Sambrook (2000), 
supra] Ausubel (1992), supra; and Ausubel (1999), supra. Product information from 

30 manufacturers of biological, chemical and immunological reagents also provide useful 
information. 

Expression vectors may be either constitutive or inducible. Inducible vectors 
include either naturally inducible promoters, such as the trc promoter, which is regulated 
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by the lac operon, and the pL promoter, which is regulated bj' tryptophan, the 
MMTV-LTR promoter, which is inducible by dexamethasone, or can contain s>'nthetic 
promoters and/or additional elements that confer inducible control on adjacent promoters. 
Examples of inducible s>'nthetic promoters are the hybrid Plac/ara-1 promoter and the 
5 PLtetO-1 promoter. The PLtetO-1 promoter takes advantage of the high expression levels 
from the PL promoter of phage lambda, but replaces the lambda repressor sites with two 
copies of operator 2 of the TnlO tetracycline resistance operon, causing this promoter to 
be tightly repressed by the Tet repressor protein and induced in response to tetracycline 
(Tc) and Tc derivatives such as anhydrotetracycline. Vectors may also be inducible 

10 because they contain hormone response elements, such as the glucocorticoid response 
element (GRE) and the estrogen response element (ERE), which can confer hormone 
inducibility where vectors are used for expression in cells having the respective hormone 
receptors. To reduce background levels of expression, elements responsive to ecdysone, 
an insect hormone, can be used instead, with coexpression of the ecdysone receptor. 

1 5 In one embodiment of the invention, expression vectors can be designed to fuse the 

expressed poK'peptide to small protein tags that facilitate purification and/or visualization. 
Such tags include a polyhistidine tag that facilitates purification of the fusion protein by 
immobilized metal affinity chromatography, for example using NiNTA resin (Qiagen Inc., 
Valencia, CA, USA) or TALON'^^^ resin (cobalt immobilized affinity chromatography 

20 medium, Clontech Labs, Palo Alto, CA, USA). The fusion protein can include a chitin- 
binding tag and self-excising intein, permitting chitin-based purification with self-removal 
of the fused tag (IMPACT™ system, New England Biolabs, Inc., Beverley, MA, USA). 
Alternatively, the fusion protein can include a calmodulin-binding peptide tag, permitting 
purification by calmodulin affinity resin (Stratagene, La Jolla, CA, LISA), or a specifically 

25 excisable fragment of the biotin carboxylase carrier protein, permitting purification of in 
vivo biotinylated protein using an avidin resin and subsequent tag removal (Promega, 
Madison, WI, USA). As another useful alternative, the polypeptides of the present 
invention can be expressed as a fusion to glutathione-S-transferase, the affinity and 
specificity of binding to glutathione permitting purification using glutathione affinity 

30 resins, such as Glutathione-Superflow Resin (Clontech Laboratories, Palo Alto, CA, 

USA), with subsequent elution with free glutathione. Other tags include, for example, the 
Xpress epitope, detectable by anti-Xpress antibody (Invitrjogen, Carlsbad, CA, USA), .a 
myc tag, detectable by anti-myc tag antibody, the V5 epitope, detectable by anti-V5 
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antibody (Invitrogen, Carlsbad CA, USA), FLAG® epitope, detectable by anti-FLAG® 
antibody (Stratagene, La Jolla, CA, USA), and the HA epitope, detectable by anti-HA 
antibody. 

For secretion of expressed polypeptides, vectors can include appropriate sequences 

5 that encode secretion signals, such as leader peptides. For example, the pSecTag2 vectors 
(Invin-ogen, Carlsbad, CA, USA) are 5.2 kb mammalian expression vectors that carry the 
secretion signal from the V-J2-C region of the mouse Ig kappa-chain for efficient secretion 
of recombinant proteins from a variety of mammalian cell lines. 

Expression vectors can also be designed to ftise proteins encoded by the 

10 heterologous nucleic acid insert to pol>T)eptides that are larger than purification and/or 

identification tags. Useful protein fusions include those that permit display of the encoded 
protein on the surface of a phage or cell, fusions to intrinsically fluorescent proteins, such 
as those that have a green fluorescent protein (GFP)-like chromophore, fusions to the IgG 
Fc region, and fusions for use in uvo hybrid systems. 

15 Vectors for phage display fuse the encoded polypeptide to, e.g., the gene III 

protein (pIII) or gene VIII protein (pVIII) for display on the surface of filamentous phage, 
such as Ml 3. See Barbas et al., Phage Displav: A Laboratory Manual Cold Spring 
Harbor Laboratory Press (2001); Kay et al (eds.), Phage Disolav of Peptides and Proteins: 
A Laboratory^ Manual Academic Press, Inc., (1996); Abelson et al (eds.), Combinatorial 

20 Chemistn^ (Methods in Enzymology, Vol. 267) Academic Press (1996). Vectors for yeast 
display, e.g. the pYDl yeast display vector (Invitrogen, Carlsbad, CA, USA), use the 
a-agglutinin yeast adhesion receptor to display recombinant protein on the surface of 5. 
cerevisiae. Vectors for mammalian display, e.g., the pDisplay™ vector (Invitrogen, 
Carlsbad, CA, USA), target recombinant proteins using an N-terminal cell surface 

25 targeting signal and a C-terminal transmembrane anchoring domain of platelet derived 
growth factor receptor. 

A wide variety of vectors now exist that fuse proteins encoded by heterologous 
nucleic acids to the chromophore of the substrate-independent, intrinsically fluoresce^it 
green fluorescent protein from Aequorea victoria ("GFP") and its variants. The GFP-like 

30 chromophore can be selected from GFP-like chromophores found in naturally occurring 
proteins, such as A. victoria GFP (GenBank accession number AAA27721), Renilla 
reniformis GFP, FP583 (GenBank accession no. AF168419) (DsRed), FP593 (AF27271 1), 
FP483 (AF168420), FP4S4 (AF168424), FP595 (AF246709), FP486 (AF168421), FP53S 
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(AF168423), and FP506 (AF168422), and need include only so much of the native protein 
as is needed to retain the chroniophore's intrinsic fluorescence. Methods for determining 
the minimal domain required for fluorescence are known in the art. See Li et aL, 1 BioL 
Chem. 272: 2S545-28549 (1997). Alternatively, the GFP-like chromophore can be 
5 selected from GFP-like chromophores modified from those found in nature. The methods 
for engineering such modified GFP-like chromophores and testing them for fluorescence 
activity, both alone and as part of protein fusions, are well known in the art. See Heim et 
aL, Ciirr. Biol 6: 178-182 (1996) and Palm et al, Methods Enzynnol 302: 378-394 (1999). 
A variet}' of such modified chromophores are now commercially available and can readily 

10 be used in the fusion proteins of the present invention. These include EGFP ("enhanced 
GFP"), EBFP ("enhanced blue fluorescent protein"), BFP2, EYFP ("enhanced yellow 
fluorescent protein''), ECFP ("enhanced cyan fluorescent protein") or Citrine. EGFP {see, 
e,g, Comiack et al. Gene 173: 33-38 (1996); U.S. Patent Nos. 6,090,919 and 5,804,387, 
the disclosures of which are incorporated herein by reference in their entireties) is found 

15 on a variety of vectors, both plasmid and viral, which are available commercially 

(Clontech Labs, Palo Alto, CA, USA); EBFP is optimized for expression in mammalian 
cells whereas BFP2, which retains the original jellyfish codons, can be expressed in 
bacteria (see, e.g,, Heim et aL, Cwr. BioL 6: 178-182 (1996) and Cormack etaL, Gene 
173: 33-38 (1996)). Vectors containing these blue-shifted variants are available from 

20 Clontech Labs (Palo Alto, CA, USA). Vectors containing EYFP, ECFP (see, e.g., Heim et 
al, Cwr. BioL 6: 178-182 (1996); Miyawaki et al. Nature 388: 882-887 (1997)) and 
Citrine (see, e.g., Heikal et aL.Proc, NatL Acad. Sci. USA 97: 11996-12001 (2000)) are 
also available from Clontech Labs. The GFP-like chromophore can also be drawn firom 
other modified GFPs, including those described in U.S. Patent Nos. 6.124,128; 6,096,865; 

25 6,090,919; 6,066,476; 6,054,321; 6,027,881; 5,968,750; 5,874,304; 5,804,387; 5,777,079; 
5,741,668; and 5,625,048, the disclosures of which are incorporated herein by reference in 
their entireties. See also Conn (ed.). Green Fluorescent Protein (Methods in Enzj^nology, 
Vol. 302), Academic Press, Inc. (1999); Yang, et al. J Biol Chem, 273: 8212-6 (1998); 
Bevis et aL, Nature Biotechnology, 20:83-7 (2002). The GFP-like chromophore of each 

30 of these GFP variants can usefully be included in the fusion proteins of the present 
invention. 

Fusions to the IgG Fc region increase seium half-life of protein pharmaceutical 
products through interaction with the FcRn receptor (also denominated the FcRp receptor 
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and the Brambell receptor, FcRb), further described in International Patent Application 
Nos. WO 97/43316, WO 97/34631, WO 96/32478, and WO 96/1S412, the disclosures of 
which are incorporated herein by reference in their entireties. 

For long-term, high-yield recombinant production of the polypeptides of the 

5 present invention, stable expression is preferred. Stable expression is readily achieved by 
integration into the host cell genome of vectors having selectable markers, followed by 
selection of these integrants. Vectors such as pUB6/V5-His A, B, and C (Invitrogen, 
Carlsbad, CA, USA) are designed for high-level stable expression of heterologous proteins 
in a wide range of mammalian tissue types and cell lines. pUB6/V5-His uses the 

10 promoter/enhancer sequence from the human ubiquitin C gene to drive expression of 

recombinant proteins: expression levels in 293, CHO, and NIH3T3 cells are comparable to 
levels from the CMV and human EE- la promoters. The bsd gene permits rapid selection 
of stably transfected mammalian cells with the potent antibiotic blasticidin. 

Replication incompetent retroviral vectors, tiT^ically derived from Moloney murine 

1 5 leukemia virus, also are useful for creating stable transfectants having integrated provirus. 
The highly efficient transduction machinery of retroviruses, coupled with the availability 
of a variety of packaging cell lines such as RetroPack™ PT 67, EcoPack2T'^-293, 
AmphoPack"293, and GP2-293 cell lines (all available from Clontech Laboratories, Palo 
Alto, CA, USA) allow a wide host range to be infected with high efficiency; var>'ing the 

20 multiplicity of infection readily adjusts the copy number of the integrated provirus. 

Of course, not all vectors and expression control sequences will function equally 
well to express the nucleic acid molecules of this invention. Neither will all hosts function 
equally well with the same expression system. Hovi'ever, one of skill in the art may make 
a selection among these vectors, expression control sequences and hosts without undue 

25 experimentation and without departing from the scope of this invention. For example, in 
selecting a vector, the host must be considered because the vector must be replicated in it. 
The vector's copy number, the ability to control that copy number, the abilit>' to control 
integration, if any, and the expression of any other proteins encoded by the vector, such as 
an antibiotic or other selection marker, should also be considered. The present invention 

30 further includes host cells comprising the vectors of the present invention, either present 
episomally within the cell or integrated, in whole or in part, into the host cell chromosome. 
Among other considerations, some of which are described above, a host cell strain may be 
chosen for its ability to process the expressed polypeptide in the desired fashion. Such 
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post-translational modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation, glycosylation, phosphorjiation, lipidation, and acylation, and it 
is an aspect of the present invention to provide CSPs with such post-translational 
modifications. 

5 In selecting an expression control sequence, a varietj' of factors should also be 

considered. These include, for example, the relative strength of the sequence, its 
controllability, and its compatibility with the nucleic acid molecules of this invention, 
particularly with regard to potential secondary structures. Unicellular hosts should be 
selected by consideration of their compatibilitj' with the chosen vector, the toxicity of the 

10 product coded for by the nucleic acid sequences of this invention, their secretion 

characteristics, their ability to fold the pol>'peptide correctly, their fenmentation or culture 
requirements, and the ease of purification from them of the products coded for by the 
nucleic acid molecules of this invention. 

The recombinant nucleic acid molecules and more particularly, the expression 

1 5 vectors of this invention may be used to express the polypeptides of this invention as 
recombinant polypeptides in a heterologous host cell. The polypeptides of this invention 
may be full-length or less than full-length polypeptide fragments recombinantly expressed 
from the nucleic acid molecules according to this invention. Such polj'peptides include 
analogs, derivatives and muteins that may or may not have biological activity. 

20 Vectors of the present invention will also often include elements that permit in 

vitro transcription of RNA from the inserted heterologous nucleic acid. Such vectors 
typically include a phage promoter, such as that from T7, T3, or SP6, flanking the nucleic 
acid insert. Often two different such promoters flank the inserted nucleic acid, permitting 
separate in vitro production of both sense and antisense strands. 

25 Transformation and other methods of introducing nucleic acids into a host cell 

(e.g., conjugation, protoplast transformation or fusion, transfection, electroporation, 
liposome delivery, membrane fusion techniques, high velocity DNA-coated pellets, viral 
infection and protoplast fusion) can be accomplished by a variet>' of methods which are 
well known in the art (See, for instance, Ausubel, supra, and Sambrook et al, supra). 

30 Bacterial, yeast, plant or mammalian cells are transformed or transfected with an 

expression vector, such as a plasmid, a cosmid, or the like, wherein the expression vector 
comprises the nucleic acid of interest. Alternatively, the cells may be infected by a viral 
expression vector comprising the nucleic acid of interest. Depending upon the host cell, 
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vector, and method of transformation used, transient or stable expression of the 
polypeptide will be constitutive or inducible. One having ordinary' skill in the art will be 
able to decide whether to express a polypeptide transiently or stably, and whether to 
express the protein constitulively or inducibly. 
5 A wide variety of unicellular host cells are useful in expressing the DNA 

sequences of this invention. These hosts may include well known eukaryotic and 
prokaryotic hosts, such as strains of, fungi, yeast, insect cells such as Spodoptera 
frugiperda (SF9), animal cells such as CHO, as well as plant cells in tissue culture. 
Representative examples of appropriate host cells include, but are not limited to, bacterial 

10 cells, such as E. coli, Caulobacter crescetUus, Streptomyces species, and Sahuofiella 

typlimiiaium: yeast cells, such as Saccharomyces cerevisiae, Schizosaccharomyces pombe, 
Pichia pastoris, Pichia methmiolica; insect cell lines, such as those from Spodoptera 
frugiperda, e.g., Sf9 and Sf2\ cell lines, and expresSF^^^ cells (Protein Sciences Corp., 
Meriden, CT, USA), Drosophila S2 cells, and Trichoplusia ni High Five® Cells 

15 {Invitrogen, Carlsbad, CA, USA); and mammalian cells. TNT)ical mammalian cells include 
BHK cells, BSC 1 cells, BSC 40 cells, BMT 10 cells, VERO cells, COSl cells, C0S7 
cells, Chinese hamster ovar>' (CHO) cells, 3T3 cells, NIH 3T3 cells, 293 cells, HEPG2 
cells, HeLa cells, L cells, MDCK cells, HEK293 cells, WI38 cells, murine ES cell lines 
{e,g, from strains 129/SV, C57/BL6, DBA-1, 129/SVJ), K562 cells, Jurkat cells, and 

20 BW5147 cells. Other mammalian cell lines are well known and readily available from 
the American T>T>e Culture Collection (ATCC) (Manassas, VA, USA) and the National 
Institute of General Medical Sciences (NIGMS) Human Genetic Cell Repository at the 
Coriell Cell Repositories (Camden, NJ, USA). Cells or cell lines derived from colon are 
particularly preferred because they may provide a more native post-translational 

25 processing. Particulariy preferred are human colon cells. 

Particular details of the transfection, expression and purification of recombinant 
proteins are well documented and are understood by those of skill in the art. Further 
details on the various technical aspects of each of the steps used in recombinant 
production of foreign genes in bacterial cell expression systems can be found in a number 

30 of texts and laborator>' manuals in the art. See, e.g., Ausubel (1992), supra, Ausubel 
(1999), suproy Sambrook (1989), supra, and Sambrook (2001), supra. 
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Methods for introducing the vectors and nucleic acid molecules of the present 
invention into the host cells are well known in the art; the choice of technique will depend 
primarily upon the specific vector to be introduced and the host cell chosen. 

Nucleic acid molecules and vectors may be introduced into prokaryotes, such as E. 
5 coli, in a number of ways. For instance, phage lambda vectors will typically be packaged 
using a packaging extract (e.g., Gigapack® packaging extract, Straiagene, La Jolla, CA, 
USA), and the packaged virus used to infect £. coli. 

Plasmid vectors will t>'pically be introduced into chemically competent or 
electrocompetent bacterial cells. E. coli cells can be rendered chemically competent by 

10 treatment, e.g., with CaCh, or a solution of Mg*^^ Mn^^ Ca'*, Rb* or K"*", dimethyl 

sulfoxide, dithiothreitol, and hexamine cobalt (III), Hanahan, J. Mol Biol, 166(4):557-S0 
(19S3), and vectors introduced by heat shock. A wide variety of chemically competent 
strains are also available commercially (e.g., Epicurian Coli® XLIO-Gold® 
Ultracompelent Cells (Stratagene, La Jolla, CA, USA); DH5a competent cells (Clontech 

15 Laboratories, Palo Alto, CA, USA); and TOP 10 Chemically Competent E. coli Kit 

(Invin"Ogen, Carlsbad, CA, USA)). Bacterial cells can be rendered electrocompetent to 
take up exogenous DNA by electroporation by various pre-pulse treatments; vectors are 
introduced by electroporation followed by subsequent outgrowth in selected media. An 
extensive series of protocols is provided by BioRad (Richmond, CA, USA), 

20 Vectors can be introduced into yeast cells by spheroplasting, treatment with 

lithium salts, electroporation, or protoplast ftision. Spheroplasts are prepared by the action 
of hydrolytic enzymes such as a snail-gut extract, usually denoted Glusulase or 
Zymolyase, or an enzyme from Arthrobacter luteus to remove portions of the cell wall in 
the presence of osmotic stabilizers, tjTpically 1 M sorbitol. DNA is added to the 

25 spheroplasts, and the mixture is co-precipitated with a solution of polyethylene glycol 
(PEG) and Ca^*. Subsequently, the cells are resuspended in a solution of sorbitol, mixed 
with molten agar and then layered on the surface of a selective plate containing sorbitol. 

For lithium-mediated transformation, yeast cells are treated with lithium acetate to 
pemieabilize the cell wall, DNA is added and the cells are co-precipitated with PEG. The 

30 cells are exposed to a brief heat shock, washed free of PEG and lithium acetate, and 
subsequently spread on plates containing ordinary selective medium. Increased 
frequencies of transformation are obtained by using specially-prepared single-stranded 



wo 2004/050860 



PCT/US2003/040063 



67 

carrier DNA and certain organic solvents. Schiestl et ai, Curr. Genet 16(5-6): 339-46 
(19S9). 

For electroporation, freshly-grown yeast cultures are t>T3ically washed, suspended 
in an osmotic protectant, such as sorbitol, mixed with DNA, and the cell suspension 
5 pulsed in an electroporation device. Subsequently, the cells are spread on the surface of 
plates containing selective media. Btcktr et ai, Methods Enz\>7noL 194: 182-187 (1991), 
The effi ciency of transfonnation by electroporation can be increased over 100-fold by 
using PEG, single-stranded carrier DNA and cells that are in late log-phase of growth. 
Larger constructs, such as YACs, can be introduced by protoplast fusion. 

10 Mammalian and insect cells can be directly infected by packaged viral vectors, or 

transfected by chemical or electrical means. For chemical transfection, DNA can be 
coprecipitated with CaP04 or introduced using liposomal and nonliposomal lipid-based 
agents. Commercial kits are available for CaP04 transfection (CalPhos^^^ Mammalian 
Transfection Kit, Clontech Laboratories, Palo Alto, CA, USA), and lipid-mediated 

15 transfection can be practiced using commercial reagents, such as LIPOFECTAMINE'^^* 
2000, LIPOFECTAMINETM Reagent, CELLFECTIN® Reagent, and LIPOFECTIN® 
Reagent (Invitrogen, Carlsbad, CA, USA), DOTAP Liposomal Transfection Reagent, 
FuGENE 6, X-tremeGENE Q2, DOSPER, (Roche Molecular Biochemicals, Indianapolis, 
IN USA), EffecteneTM, PoIyFeci®, Superfect® (Qiagen, Inc., Valencia, CA, USA). 

20 Protocols for electroporating mammalian cells can be found in, for example, ; Norton et 
al (eds.). Gene Transfer Methods: Introducing DNA into Living Cells and Organisms , 
BioTechniques Books, Eaton Publishing Co. (2000). Other transfection techniques 
include transfection by particle bombardment and microinjection. See, eg., Cheng et al, 
Proc. Natl, Acad ScL USA 90(10): 4455-9 (1993); Yang et al.Proc. Natl Acad, ScL USA 

25 87(24): 9568-72 (1990). 

Production of the recombinantly produced proteins of the present invention can 
optionally be followed by purification. 

Purification of recombinantly expressed proteins is now well within the skill in the 
art and thus need not be detailed here. See, e.g., Thomer et al (eds.), Applications of 

30 Chimeric Genes and Hybrid Proteins. Part A: Gene Expression and Protein Purification 
(Methods in Enzymolog>', Vol. 326), Academic Press (2000); Harbin (ed.), Cloning. Gene 
Expression and Proteia.Purification : Experimental Procedures and Process Rationale. 
Oxford Univ. Press (2001); Marshak et al. Strategies for Protein Purification and 
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Characterization: A Laboratory^ Course Manual Cold Spring Harbor Laboratory Press 
(1996); and Roe (ed.), Protein Purification Applications . Oxford University Press (2001). 

Briefly, however, if purification tags have been fused through use of an expression 
vector that appends such tags, purification can be effected, at least in part, by means 
5 appropriate to the tag, such as use of immobilized metal affinity chromatography for 
polyhistidine tags. Other techniques common in the art include ammonium sulfate 
fractionation, immunoprecipitation, fast protein liquid chromatography (FPLC), high 
performance liquid chromatography (HPLC), and preparative gel electrophoresis. 

Polvpeptides, including Fragments Muteins. Homologous Proteins. Allelic Variants, 

10 Analogs and Derivatives 

Another aspect of the invention relates to polypeptides encoded by the nucleic acid 
molecules described herein. In a preferred embodiment, the polypeptide is a colon 
specific polypeptide (CSP). In an even more preferred embodiment, the polypeptide 
comprises an amino acid sequence of SEQ ID NO:96-237 or is derived from a polypeptide 

1 5 having the amino acid sequence of SEQ ID NO: 96-237. A polypeptide as defined herein 
may be produced recombinantly, as discussed supra, may be isolated from a cell that 
naturally expresses the protein, or may be chemically synthesized following the teachings 
of the specification and using methods well known to those having ordinary skill in the art. 
Polypeptides of the present invention may also comprise a part or fragment of a 

20 CSP. In a preferred embodiment, the fragment is derived from a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 96-237. 
Pohpeptides of the present invention comprising a part or fragment of an entire CSP may 
or may not be CSPs. For example, a full-length pol>peptide may be colon-specific, while 
a fragment thereof may be found in other tissues as well as in colon. A polypeptide that is 

25 not a CSP, whether it is a fragment, analog, mutein, homologous protein or derivative, is 
nevertheless useful, especially for immunizing animals to prepare anti-CSP antibodies. In 
a preferred embodiment, the part or fragment is a CSP. Methods of determining whether a 
pohpeptide of the present invention is a CSP are described infra. 

Polypeptides of the present invention comprising fragments of at least 6 

30 contiguous amino acids are also useful in mapping B cell and T cell epitopes of the 
reference protein. See, e.g., Geysen efai, Proc. Natl. Acad ScL USA SI: 3998-4002 
(1984) and U,S. Patent Nos. 4,708,871 and 5,595,915, the disclosures of which are 
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incorporated herein by reference in their entireties. Because the fragment need not itself 
be immunogenic, part of an immunodominant epitope, nor even recognized by native 
antibody, to be useful in such epitope mapping, all fragments of at least 6 amino acids of a 
poljpeptide of the present invention have utility in such a study. 
5 Pol>'peptides of the present invention comprising fragments of at least 8 

contiguous amino acids, often at least 15 contiguous amino acids, are usefiil as 
immunogens for raising antibodies that recognize polypeptides of the present invention. 
See, e.g., Lemer, Nature 299: 592-596 (19S2); Shinnick et al, Annu. Rev. Microbiol 37: 
425-46 (1983); Sutcliffe et al. Science 219: 660-6 (I9S3). As further described in the 

10 above-cited references, virtually all S-mers, conjugated to a carrier, such as a protein, 
prove immunogenic and are capable of eliciting antibody for the conjugated peptide; 
accordingly, all fragments of at least 8 amino acids of the poljTDeptides of the present 
invention have utilit\' as immunogens. 

Pol>Tpeptides comprising fragments of at least S, 9, 10 or 12 contiguous amino 

15 acids are also useful as competitive inhibitors of binding of the entire polypeptide, or a 
portion thereof, to antibodies (as in epitope mapping), and to natural binding partners, 
such as subunits in a multimeric complex or to receptors or ligands of the subject protein; 
this competitive inhibition permits identification and separation of molecules that bind 
specifically to the polypeptide of interest. See U.S. Patent Nos. 5,539,084 and 5,783,674, 

20 incorporated herein by reference in their entireties. 

The polypeptide of the present invention thus preferably is at least 6 amino acids in 
length, typically at least 8, 9, 10 or 12 amino acids in length, and often at least 15 amino 
acids in length. Often, the polypeptide of the present invention is at least 20 amino acids 
in length, even 25 amino acids, 30 amino acids, 35 amino acids, or 50 amino acids or more 

25 in length. Of course, larger polypeptides having at least 75 amino acids, 100 amino acids, 
or even 150 amino acids are also useful, and at times preferred. 

One having ordinar>' skill in the art can produce fragments by truncating the 
nucleic acid molecule, e.g., a CSNA, encoding the polypeptide and then expressing it 
recombinantly. Alternatively, one can produce a fragment by chemically synthesizing a 

30 portion of the full-length polypeptide. One may also produce a fragment by enzymatically 
cleaving either a recombinant polypeptide or an isolated naturally occurring polj'peptide. 
Methods" of producing polypeptide fragments are well known in the art. See, e.g,, 
Sambrook (1989), supra; Sambrook (2001), supra; Ausubel (1992), supra; and Ausubel 
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(1999), supra. In one embodiment, a polypeptide comprising only a fragment, preferably 
a fragment of a CSP, may be produced by chemical or enzymatic cleavage of a CSP 
polypeptide. In a preferred embodiment, a polypeptide fragment is produced by 
expressing a nucleic acid molecule of the present invention encoding a fragment, 
5 preferably of a CSP, in a host cell 

PoKpeptides of the present invention are also inclusive of mutants, fusion proteins, 
homologous proteins and allelic variants. 

A mutant protein, or mutein, may have the same or different properties compared 
to a naturally occurring polypeptide and comprises at least one amino acid insertion, 

10 duplication, deletion, rearrangement or substitution compared to the amino acid sequence 
of a native pol>peptide. Small deletions and insertions can often be found that do not alter 
the function of a protein. Muteins may or may not be colon-specific. Preferably, the 
mutein is colon-specific. More preferably the mutein is a poljTpeptide that comprises at 
least one amino acid insertion, duplication, deletion, rearrangement or substitution 

15 compared to the amino acid sequence of SEQ ID NO: 96-237. Accordingly, in a preferred 
embodiment, the mutein is one that exhibits at least 50% sequence identity, more 
preferably at least 60% sequence identity, even more preferably at least 70?/o, yet more 
preferably at least 80% sequence identity to a CSP comprising an amino acid sequence of 
SEQ ID NO: 96-237. In a yet more preferred embodiment, the mutein exhibits at least 

20 85%, more preferably 90%, even more preferably 95% or 96%, and yet more preferably at 
least 97%, 98%, 99% or 99.5% sequence identity to a CSP comprising an amino acid 
sequence of SEQ ID NO: 96-237. 

A mutein may be produced by isolation from a naturally occurring mutant cell, 
tissue or organism. A mutein may be produced by isolation from a cell, tissue or organism 

25 that has been experimentally mutagenized. Alternatively, a mutein may be produced by 
chemical manipulation of a polypeptide, such as by altering the amino acid residue to 
another amino acid residue using synthetic or semi-synthetic chemical techniques. In a 
preferred embodiment, a mutein is produced from a host cell comprising a mutated nucleic 
acid molecule compared to the naturally occurring nucleic acid molecule. For instance, 

30 one may produce a mutein of a polypeptide by introducing one or more mutations into a 
nucleic acid molecule of the invention and then expressing it recombinantly. These 
mutations may be targeted, in which particular encoded amino acids are altered, or may be 
untargeted, in which random encoded amino acids .within the polypeptide are altered. 
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Muteins with random amino acid alterations can be screened for a particular biological 
activity or property, particularly whether the polypeptide is colon-specific, as described 
below. Multiple random mutations can be introduced into the gene by methods well 
known to the art, e.g., by error-prone PCR, shuffling, oligonucleotide-directed 
5 mutagenesis, assembly PCR, sexual PCR mutagenesis, in vivo mutagenesis, cassette 

mutagenesis, recursive ensemble mutagenesis, exponential ensemble mutagenesis and site- 
specific mutagenesis. Methods of producing muteins with targeted or random amino acid 
alterations are well known in the art. See, e.g., Sambrook (19S9), supra; Sambrook 
(2001), supra; Ausubel (1992), supra] and Ausubel (1999), as well as U.S. Patent No. 

10 5,223,408, which is herein incorporated by reference in its entirety. 

The invention also contemplates pol>peptides that are homologous to a 
polypeptide of the invention. In a preferred embodiment, the pol>TDeptide is homologous 
to a CSP. In an even more preferred embodiment, the polypeptide is homologous to a 
CSP selected from the group having an amino acid sequence of SEQ ID NO: 96-237. By 

15 homologous polypeptide it is meant one that exhibits significant sequence identity' to a 
CSP, preferably a CSP having an amino acid sequence of SEQ ID NO: 96-237. By 
significant sequence identity it is meant that the homologous polypeptide exhibits at least 
50% sequence identity, more preferably at least 60?/4 sequence identity, even more 
preferably at least 70%, yet more preferably at least 80% sequence identitj' to a CSP 

20 comprising an amino acid sequence of SEQ ID NO: 96-237. More preferred are 
homologous polypeptides exhibiting at least 85%, more preferably 90%, even more 
preferably 95% or 96%, and yet more preferably at least 97% or 98% sequence identity to 
a CSP comprising an amino acid sequence of SEQ ID NO: 96-237. Most preferably, the 
homologous polypeptide exhibits at least 99%, more preferably 99.5%, even more 

25 preferably 99.6%, 99.7%. 99.8% or 99.9% sequence identity to a CSP comprising an 

amino acid sequence of SEQ ID NO: 96-237. In a preferred embodiment, the amino acid 
substinitions of the homologous polypeptide are conservative amino acid substitutions as 
discussed supra. 

Homologous polypeptides of the present invention also comprise polypeptide 
30 encoded by a nucleic acid molecule that selectively hybridizes to a CSNA or an antisense 
sequence thereof. In this embodiment, it is preferred that the homologous polypeptide be 
encoded by a nucleic acid molecule that hybridizes to a CSNA under low stringeiicy, 
moderate stringency or high stringency conditions, as defined herein. More preferred is a 
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homologous pol>T)eptide encoded by a nucleic acid sequence which hybridizes to a CSNA 
selected from the group consisting of SEQ ID NO: 1-95 or a homologous polypeptide 
encoded by a nucleic acid molecule that hybridizes to a nucleic acid molecule that encodes 
a CSP, preferably a CSP of SEQ ID NO:96-237 under low stringency, moderate 
5 stringency or high stringency conditions, as defined herein. 

Homologous pol>'peptides of the present invention may be naturally occurring and 
derived from another species, especially one derived from another primate, such as 
chimpanzee, gorilla, rhesus macaque, or baboon, wherein the homologous polypeptide 
comprises an amino acid sequence that exhibits significant sequence identity to that of 

10 SEQ ID NO: 96-237. The homologous polypeptide may also be a naturally occurring 
polypeptide from a human, when the CSP is a member of a family of polypeptides. The 
homologous polypeptide may also be a naturally occurring polypeptide derived from a 
non-primate, mammalian species, including without limitation, domesticated species, e.g., 
dog. cat, mouse, rat, rabbit, guinea pig, hamster, cow, horse, goat or pig. The homologous 

1 5 polypeptide may also be a naturally occurring polypeptide derived from a non-mammalian 
species, such as birds or reptiles. The naturally occurring homologous protein may be 
isolated directly from humans or other species. Alternatively, the nucleic acid molecule 
encoding the naturally occurring homologous polypeptide may be isolated and used to 
express the homologous polypeptide recombinantly. The homologous polypeptide may 

20 also be one that is experimentally produced by random mutation of a nucleic acid 
molecule and subsequent expression of the nucleic acid molecule. Alternatively, the 
homologous polypeptide may be one that is experimentally produced by directed mutation 
of one or more codons to alter the encoded amino acid of a CSP. In a preferred 
embodiment, the homologous polypeptide encodes a polypeptide that is a CSP. 

25 Relatedness of proteins can also be characterized using a second fiinctional test, 

such as the ability' of a first protein competitively to inhibit the binding of a second protein 
to an antibody. It is, therefore, another aspect of the present invention to provide isolated 
polypeptides not only identical in sequence to those described with particularity herein, 
but also to provide isolated polypeptides ("cross-reactive proteins") that competitively 

30 inhibit the binding of antibodies to all or to a portion of the isolated polypeptides of the 
present invention. Such competitive inhibition can readily be determined using 
immunoassays well known in the art. 
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As discussed above, single nucleotide polymorphisms (SNPs) occur frequently in 
eukaryotic genomes, and the sequence determined from one individual of a species may 
differ from other allelic fomis present within the population. Thus, polypeptides of the 
present invention are also inclusive of those encoded by an allelic variant of a nucleic acid 
5 molecule encoding a CSP. In this embodiment, it is preferred that the polj'peptide be 
encoded by an allelic variant of a gene that encodes a polypeptide having the amino acid 
sequence selected from the group consisting of SEQ ID NO: 96-237. More preferred is 
that the polypeptide be encoded by an allelic variant of a gene that has the nucleic acid 
sequence selected from the group consisting of SEQ ID NO: 1-95. 

10 Polypeptides of the present invention are also inclusive of derivative polypeptides 

encoded by a nucleic acid molecule according to the instant invention. In this 
embodiment, it is preferred that the polj^eptide be a CSP. Also preferred are derivative 
polypeptides having an amino acid sequence selected from the group consisting of SEQ 
ID NO: 96-237 and which has been acet>'lated, carboxylated, phosphorylated, 

1 5 glycosylated, ubiquitinated or post-translationally modified in another manner. In another 
preferred embodiment, the derivative has been labeled with, e.g., radioactive isotopes such 
as '^^I, ^^P, ^^S, and ^H. In another preferred embodiment, the derivative has been labeled 
with fluorophores, chemiluminescent agents, enzymes, and antiligands that can serve as 
specific binding pair members for a labeled ligand. 

20 Polj^peptide modifications are well known to those of skill and have been 

described in great detail in the scientific literature. Several particularly common 
modifications, glycosylation, lipid attachment, sulfation, gamma-carboxylation of 
glutamic acid residues, hydroxylation and ADP-ribosylation, for instance, are described in 
most basic texts, such as, for instance Creighton, Protein Structure and Molecular 

25 Properties > 2nd ed., W. H. Freeman and Company (1993). Many detailed reviews are 

available on this subject, such as, for example, those provided by Wold, in Johnson (ed.), 
Posttranslational Covalent Modification of Proteins , pgs. 1-12, Academic Press (1983); 
Seifter et al, Metk Enzymtol. 182: 626-646 (1990) and Rattan ei ah, Amu N.Y. Acad, ScL 
663: 4S-62 (1992). 

30 One may determine whether a polypeptide of the invention is likely to be post- 

translationally modified by analyzing the sequence of the polypeptide to determine if there 
"are peptide' motifs indicative of sites for post-translational modification. There are a 
number of computer programs that permit prediction of post-translational modifications. 
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See, e.g., expasy.org (accessed November 11, 2002) of the world wide web, which 
includes PSORT, for prediction of protein sorting signals and localization sites, Signal?, 
for prediction of signal peptide cleavage sites, MITOPROT and Predotar, for prediction of 
mitochondrial targeting sequences, NetOGlyc, for prediction of type O^glycosylation sites 
5 in mammalian proteins, big-PI Predictor and DGPI, for prediction of prenylation-anchor 
and cleavage sites, and NetPhos, for prediction of Ser, Thr and Tyr phosphorylation sites 
in eukaryotic proteins. Other computer programs, such as those included in GCG, also 
may be used to determine post-translational modification peptide motifs. 

General examples of types of post-translational modifications include, but are not 
10 limited to: (Z)-dehydrobutyrine; 1-chondroitin sulfate-L-aspartic acid ester; I'-glycosyl-L- 
trjTptophan; T-phospho-L-histidine; 1-thioglycine; 2'-(S-L-cysteinyl)-L-histidine; 2-[3- 
carboxamido (trimethylammonio)propyl]-L-histidine; 2'-alpha-mannosyl-L-tiyptophan; 2- 
methyl-L-glutamine; 2-oxobutanoic acid; 2-pyrrolidone carboxylic acid; 3 -(I'-L-histidyl)- 
L-tyrosine; 3'-(Salpha-FAD)-L-histidine; 3'-(S-L-cysteinyl)-L-t}Tosine; 3', 3",5*-triiodo-L- 
1 5 thjTonine; 3'-4'-phospho-L-tyrosine; 3-hydroxy-L-proline; 3'-methyl-L-.histidine; 3- 

methyl-L-lanthionine; 3'-phospho-L-histidine; 4'-(L-trj'ptop]ian)-L-tryptophyl quinone; 42 
N-cysteinyl-glycosylphosphatidylinositolethanolamine; 43 -(T-L-histidyl)-L-tyrosine; 4- 
hydroxy-L-arginine; 4-hydroxy-L-lysine; 4-hydroxy-L-proline; 5*-(N6-Uysine)-L- 
topaquinone; 5-hydroxy-L-lysine; S-methyl-L-arginine; alpha-l-microglobulin-Ig alpha 
20 complex chroniophore; bis-L-cysteinyl bis-L-histidino diiron disulfide; bis-L-cysteinyl-L- 
N3'-histidino-L-serinyI tetrairon* tetrasulfide; chondroitin sulfate D-glucuronyl-D- 
galactosyl-D-galactosyl-D-xylosyl-L-serine; D-alanine; D-allo-isoleucine; D-asparagine; 
dehydroalanine; dehydrotyrosine; dermatan 4-sulfate D-glucuronyl-D-galactosyl-D- 
galactosyl-D-xylosyl-L-serine; D-glucuronyl-N-glycine; dipyrrolylniethanemethyl-L- 
25 cysteine; D-leucine; D-methionine; D-phenylalanine; D-serine; D-tryptophan; glycine 

amide; glycine oxazolecarboxylic acid; glycine thiazolecarboxylic acid; heme P450-bis-L- 
cysteine-L-tjTosine; heme-bis-L-cysteine; hemedioKL-aspartyl ester-L-glutamyl ester; 
hemedioI-L-aspartyl ester-L-glutamyl ester-L-methionine sulfonium; heme-L-cysteine; 
heme-L-histidine; heparan sulfate D-glucuronyl-D-galactosyl-D-galactosyl-D-xylosyl-L- 
30 serine; heme P450-bis-L-cysteine-L-lysine; hexakis-L-cysteinyl hexairon hexasulfide; 
keratan sulfate D-glucuronyl-D-galactosyl-D-galactosyl-D-xylosyl-L-threonine; L 
oxoalanine- lactic acid; L phenyllactic acid; r-(8alpha-FAD>L-histidine; L-2'.4',5'- 
topaquinone; L-3*,4*-dihydroxyphenylalanine; L-3*.4'.5'-trihydroxyphenylalanine; L-4'- 
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bromophenylalanine; L-6'-bromotr}T3tophan; L-alanine amide; L-alanyl imidazolinone 
glycine; L-allysine; L-arginine amide; L-asparagine amide; L-aspartic 4-phosphoric 
anhydride; L-aspartic acid 1 -amide; L-beta-methylthioaspartic acid; L-bromohistidine; L- 
citrulline; L-cysteine amide; L-cysteine glutathione disulfide; L-cysteine methyl disulfide; 
5 L-cysteine methyl ester; L-cysteine oxazolecarboxylic acid; L-cysteine 

oxazolinecarboxylic acid; L-cysleine persulfide; L-cysteine sulfenic acid; L-cysteine 
sulfinic acid; L-cysteine thiazolecarboxylic acid; L-cysteinyl homocitryl molybdenum- 
heptairon-nonasulfide; L-cysteinyl imidazolinone glycine; L-cysteinyl molybdopterin; L- 
cysteinyl molybdopterin guanine dinucleotide; L-cystine; L-erjthro-beta- 

10 hydroxyasparagine; L-erythro-beta-hydroxyaspartic acid; L-gamma-carboxyglutamic acid; 
L-glutamic acid 1 -amide; L-glutamic acid 5-methy] ester; L-glutamine amide; L-glutamyl 
5-glycer)lphosphor}'lethanolamine; L-histidine amide; L-isoglutamyl-polyglutamic acid; 
L-isogluiamyl-polyglycine; L-isoleucine amide; L-lanthionine; L-leucine amide; L-lysine 
amide; L-lysine thiazolecarboxylic acid; L-lysinoalanine; L-methionine amide; L- 

15 methionine sulfone; L-phenyalanine thiazolecarboxylic acid; L-phenylalanine amide; L- 
proline amide; L-selenocysteine; L-selenocysteinyl molybdopterin guanine dinucleotide; 
L-serine amide; L-serine thiazolecarboxylic acid; L-seryl imidazolinone glycine; L-T- 
bromophenylalanine; L-T-bromophenylalanine; L-threonine amide; L-thyroxine; L- 
tryptophan amide; L-trjptophyl quinone; L-tyrosine amide; L-valine amide; meso- 

20 lanthionine; N-(L-glutamyl)-L-t>TOsine: N-(L-isoaspartyl)-glycine; N-{L-isoaspartyl)-L- 
cysteine; N,N,N-trimethyl-L-alanine; N,N-dimethyl-L-proline; N2-acet>'l-L-lysine; N2- 
succinyl-L-tr>ptophan; N4-(ADP-ribosyl)-L-asparagine; N4-glycosyl-L-asparagine; N4- 
hydroxymethyl-L-asparagine; NM-methyl-L-asparagine; N5-methyl-L-glutamine; N6- 1 - 
carboxyethyl-L-lysine; N6-(4-aniino hydroxybut>i)-L-lysine; N6-(L-isoglutamyl)-L- 

25 lysine; N6-(phospho-5'-adenosine)-L-lysine; N6-(phospho-5*-guanosine)-L-tysine; 

N6,N6,N6-trimethyl.L-lysine; N6,N6-dimethyl-L-lysine; N6-acetyl-L-lysine; N6-biotinyl- 
L-Iysine; N6-carboxy-L-lysine; N6-fonnyl-L-lysine; N6-glycyl.L-lysine; N6-lipoyl-L- 
lysine; N6-methyl-L-lysine; N6-methyl-N6-poly(N-methyl-propylamine)-L-lysine; N6- 
mureinyl-L-lysine; N6-myristoyl-L-lysine; N6-palmitoyl-L-ly$ine; N6-pyridoxal 

30 phosphate-L-Iysine; N6-pyruvic acid 2-iminyl-L-lysine; N6-retinal-L-lysine; N- 

acetylglycine; N-acetyl-L-glutamine; N-acetyl-L-alanine; N-acetyl-L-aspartic acid; N- 
acetyl-L-cysteine: N-acetyl-L-glutamic acid; N-acet>'l-L-isoleucine; N-acetyl-L- 
methionine; N-acetj'l-L-proline; N-acetyl-L-serine; N-acetyl-L-threonine; N-acetyl-L- 
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tyrosine; N-acetyl-L- valine; N-alanyl-glycosylphosphatidylinositolethanolamine; N- 
asparaginyl-glycosylphosphatidylinositolethanolamine; N-aspartyl- 
glycosylphosphatidylinositolethanolamine; N-fomiylglycine; N-formyl-L-methionine; N- 
glycyl-glycosylphospharidylinositolethanolamine; N-L-glutamyl-poly-L-glutamic acid; N- 
5 methylglycine; N-methyl-L-alanine; N-methyl-L-methionine; N-methyl-L-phenylalanine; 
N-mjTistoyl-glycine; N-palmitoyM-cysteine; N-pyruvic acid 2-iminyl-L-cysteine; N- 
pyruvic acid 2-iminyl-L-valine; N-seryl-glycosylphosphatidylinosilolethanolan\ine; N- 
seryl-glycosyCSPhingolipidinositolethanolamine; 0-(ADP-ribosyl)-L-serine; 0-(phospho- 
5-adenosine>L-threonine; 0-(phospho-5'-DNA)-L-serine; 0-(phosphoo'-DNA)-L- 

10 tlueonine; 0-(phospho-5'rRNA)'L-serine; 0-(phosphoribosy] dephospho-coenzjTne A)-L- 
serine; 0-(sn-l-glycerophosphotyl)-L-serine; 04'-(8alpha-FAD)-L-tyrosine; 04*-(phospho- 
5'-adenosine)-L-t>'rosine; 04'-(phospho-5*-DNA)-L-tyrosine; 04'-(phosphoo -RNA)-L- 
tjTosine; 04'-(phospho-5'-uridine)-L-tyrosine; 04-glycosyl-L-hydroxyproline; 04'- 
glycosyl-L-tyrosine; 04'-sulfo-L-tyrosine; OS-glycosyl-L-hydroxylysine; 0-glycosyl-L- 

15 serine; O-glycosyl-L-threonine; omega-N-(ADP-ribosyl)-L-argimne; omega-N-omega-N- 
dimethyl-L-arginine; omega-N-methyl-L-arginine; omega-N-omega-N-dimethyl-L- 
arginine; omega-N-phospho-L-arginine; 0*octanoyl-L-serine; O-palmitoyl-L-serine; O- 
palmitoyl-L-threonine; 0-phospho-L-serine; 0-phospho-L-threonine; O- 
phosphopantetheine-L-serine; phycoerythrobilin-bis-L-cysteine; phycourobilin-bis-L- 

20 cysteine; pyrroloquinoline quinone; p>Tuvic acid; S hydroxycinnamyl-L-cysteine; S-(2- 
aminovinyl) methyl-D-eysleine; S-(2-aniinovinyl)-D-cysteine; S-(6-FW-L-cysteine; S- 
(8alpha.FAD)-L-cysteine; S-(ADP-ribosyl)-L-cysteine; S-(L-isoglutamyl)-L-cysteine; S- 
12-hydroxyfamesyl-L-cysteine; S-acetyl-L-cysteine; S-diacylglycerol-L-cysteine; S- 
diph>tanylglycerot diether-L-cysteine; S-famesyl-L-cysteine; S-geranylgeranyl-L- 

25 cysteine; S-glycosyl-L-cysteine; S-glycyl-L-cysteine; S-methyl-L-cysteine; S-nitrosyl-L- 
cysteine; S-palmitoyl-L-cysteine; S-phospho-L-cysteine; S-phycobiliviolin-L-cysteine; S- 
phycocyanobilin-L-cysteine; S-phycoerythrobilin-L-cysteine; S-phytochromobilin-L- 
cysteine; S-selenyl-L-cysteine; S-sulfo-L-cysteine; tetrakis-L-cysteinyl diiron disulfide; 
tetrakis-L-cysteinyl iron; tetrakis-L-cysteinyl tetrairon tetrasulfide; trans-2,3-cis 4- 

30 dihydroxy-L-proIine; tris-L-cysteinyl triiron tetrasulfide; tris-L-cysteinyl triiron trisulfide; 
tris-L-cysteinyl-L-aspartato tetrairon tetrasulfide; tris-L-cysteinyl-L-cysteine persulfido- 
.bis-L-glutamata-L-histidino tetrairon disulfide trioxide; tris-L-cysteinyl-L-N3'-histidino . 
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tetrairon tetrasulfide; tris-L-cysteinyl-L-Nl'-histidino tetrairon tetrasulfide; and tris-L- 
cysteinyl-L-serinyl tetrairon tetrasulfide. 

Additional examples of PTMs may be found in web sites such as the Delta Mass 
database based on Krishna, R. G. and F. Wold (1998). Posttranslational Modifications. 
5 Proteins - Analysis and Design. R. H. Angeletti. San Diego, Academic Press. 1: 121-206; 
Methods in Enz>'molog>', 193, J.A. McClosky (ed) (1990), pages 647-660; Methods in 
Protein Sequence Analysis edited by Kazutomo Imahori and Fumio Sakiyama. Plenum 
Press, (1993) "Post-translational modifications of proteins" R.G. Krishna and F. Wold 
pages 167-172; "GlycoSuiteDB: a new curated relational database of glycoprotein glycan 

10 structures and their biological sources" Cooper et al. Nucleic Acids Res. 29; 332-335 

(2001) "0-GLYCBASE version 4.0: a revised database of 0-gl)'cosylated proteins" Gupta 
et al. Nucleic Acids Research, 27: 370-372 (1999); and "PhosphoBase, a database of 
phosphor>'lation sites: release 2.0.", Kreegipuu et al.Nucleic Acids Res 27(l):237-239 
(1999) see also,WO 02/21 139A2, the disclosure of which is incorporated herein by 

1 5 reference in its entirety. 

Tumorigenesis is often accompanied by alterations in the post-translational 
modifications of proteins. Thus, in another embodiment, the invention provides 
polypeptides from cancerous cells or tissues that have altered post-translational 
modifications compared to the post-translational modifications of polypeptides from 

20 normal cells or tissues. A number of altered post-translational modifications are known. 
One common alteration is a change in phosphorylation state, wherein the polypeptide from 
the cancerous cell or tissue is hyperphosphorylated or hypophosphorylated compared to 
the polypeptide from a normal tissue, or wherein the polypeptide is phosphoiylated on 
different residues than the polypeptide from a normal cell. Another common alteration is 

25 a change in glycosylation state, wherein the polypeptide from the cancerous cell or tissue 
has more or less glycosylation than the polypeptide from a normal tissue, and/or wherein 
the polypeptide from the cancerous cell or tissue has a different type of glycosylation than 
the polypeptide from a noncancerous cell or tissue. Changes in glycosylation may be 
critical because carbohydrate-protein and carbohydrate-carbohydrate interactions are 

30 important in cancer cell progression, dissemination and invasion. See, e.g., Barchi, Ciar. 
Pbarm. Das. 6: 485-501 (2000), Verma, Cancer Biochem. Biophys. 14: 151-162 (1994) 
and Dennis et al., Bioessays 5: 412-421 (1999). 
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Another post-translational modification that may be altered in cancer cells is 
prenylation. Prenylation is the covalent attachment of a hydrophobic^prenyl group (either 
famesyl or geranylgeranyl) to a polypeptide. Prenylation is required for localizing a 
protein to a cell membrane and is often required for poljpeptide ftinction. For instance, 
5 the Ras superfamily of GTPase signalling proteins must be prenylated for function in a 
cell. See, e.g., Prendergast et al., Semin. Cancer Biol 10: 443-452 (2000) and Khwaja et 
zX,, Lancet 355: 741-744 (2000). 

Other post-translation modifications that may be altered in cancer cells include, 
without limitation, pol>'peptide methylation, acetylation, arginylation or racemization of 

10 amino acid residues. In these cases, the polypeptide from the cancerous cell may exhibit 
either increased or decreased amounts of the post-translational modification compared to 
the corresponding polypeptides from noncancerous cells. 

Other pol>Tpeptide alterations in cancer cells include abnormal polypeptide 
cleavage of proteins and aberrant protein-protein interactions. Abnormal polypeptide 

15 cleavage may be cleavage of a polypeptide in a cancerous cell that does not usually occur 
in a normal cell, or a lack of cleavage in a cancerous cell, wherein the polypeptide is 
cleaved in a normal cell. Aberrant protein-protein interactions may be either covalent 
cross-linking or non-covalent binding beUveen proteins that do not normally bind to each 
other. Alternatively, in a cancerous cell, a protein may fail to bind to another protein to 

20 which it is bound in a noncancerous cell. Alterations in cleavage or in protein-protein 
interactions may be due to over- or underproduction of a polypeptide in a cancerous cell 
compared to that in a nomial cell, or may be due to alterations in post-translational 
modifications (see above) of one or more proteins in the cancerous cell. See, e.g., 
Henschen-Edman, Ann. KY, Acad. Sci, 936: 580-593 (2001). 

25 Alterations in polypeptide post-translational modifications, as well as changes in 

polypeptide cleavage and protein-protein interactions, may be determined by any method 
known in the art. For instance, alterations in phosphorylation may be determined by using 
anti-phosphoserine, anti-phosphothreonine or anti-phosphot>Tosine antibodies or by amino 
acid analysis. Glycosylation alterations may be determined using antibodies specific for 

30 different sugar residues, by carbohydrate sequencing, or by alterations in the size of the 
glycoprotein, which can be determined by, e.g., SDS polyacrylamide gel electrophoresis 
(PAGE). Other alterations of post-translational modifications, such as prenylation. 
racemization, methylation, acetylation and arginylation, may be determined by chemical 
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analysis, protein sequencing, amino acid analysis, or by using antibodies specific for the 
particular post-translational modifications. Changes in protein-protein interactions and in 
pol>'peptide cleavage may be analyzed by any method known in the art including, without 
limitation, non-denaturing PAGE (for non-covalent protein-protein interactions), SDS 
5 PAGE (for covalent protein-protein interactions and protein cleavage), chemical cleavage, 
protein sequencing or immunoassays. 

In another embodiment, the invention provides pol>i)eptides that have been post- 
translationally modified. In one embodiment polypeptides may be modified 
enzymatically or chemically, by addition or removal of a post-translational modification, 

10 For example, a polypeptide may be glycosylated or deglycosylated enz>Tnatical]y. 

Similarly, polj^eptides may be phosphorj'lated using a purified kinase, such as a MAP 
kinase (e.g. p3S, ERK, or JNK) or a tjTosine kinase (e.g., Src or erbB2). A polypeptide 
may also be modified through synthetic chemistry. Alternatively, one may isolate the 
pol>'peptide of interest from a cell or tissue that expresses the polypeptide with the desired 

15 post-translational modification. In another embodiment, a nucleic acid molecule encoding 
the polypeptide of interest is introduced into a host cell that is capable of post- 
translationally modifying the encoded polypeptide in the desired fashion. If the 
pol>peptide does not contain a motif for a desired post-translational modification, one may . 
alter the post-translational modification by mutating the nucleic acid sequence of a nucleic 

20 acid molecule encoding the poljpeptide so that it contains a site for the desired post- 
translational modification. Amino acid sequences that may be post-translationally 
modified are known in the art. See, e.g., the programs described above on the website 
expasy.org of the world wide web. The nucleic acid molecule may also be introduced into 
a host cell that is capable of post-translationally modifying the encoded polypeptide. 

25 Similarly, one may delete sites that are post-translationally modified by either mutating 
the nucleic acid sequence so that the encoded polypeptide does not contain the post- 
translational modification motif, or by introducing the native nucleic acid molecule into a 
host cell that is not capable of post-translationally modifying the encoded polypeptide. 
It will be appreciated, as is well known and as noted above, that polypeptides are 

30 not always entirely linear. For instance, poljpeptides may be branched as a result of 

ubiquitination, and they may be circular, with or without branching, generally as a result 
of posttransbtion events, including natural processing events-and events brought about by 
human manipulation which do not occur naturally. Circular, branched and branched 
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circular polypeptides may be synthesized by non-translation natural processes and by 
entirely synthetic methods, as well. Modifications can occur an^^here in a pol>'peptide, 
including the peptide backbone, the amino acid side-chains and the amino or carboxyl 
termini. In fact, blockage of the amino or carboxyl group in a polypeptide, or both, by a 
5 covalent modification, is common in naturally occurring and synthetic polypeptides and 
such modifications may be present in pohT^eptides of the present invention, as well. For 
instance, the amino terminal residue of polypeptides made in £. coli, prior to proteolytic 
processing, almost invariably will be N-formylmethionine, 

Useful post-synthetic (and post-translational) modifications include conjugation to 

10 detectable labels, such as fluorophores. A wide variety of amine-reactive and thiol- 
reactive fluorophore derivatives have been synthesized that react under nondenaturing 
conditions with N-terminal amino groups and epsilon amino groups of lysine residues, on 
the one hand, and with free thiol groups of cysteine residues, on the other. 

Kits are available commexcially that permit conjugation of proteins to a variety of 

1 5 amine-reactive or thiol-reactive fluorophores: Molecular Probes, Inc. (Eugene, OR, USA), 
e.g., offers kits for conjugating proteins to Alexa Fluor 350, Alexa Fluor 430, 
Fluorescein-EX, Alexa Fluor 488, Oregon Green 488, Alexa Fluor 532, Alexa Fluor 546, 
Alexa Fluor 546, Alexa Fluor 568, Alexa Fluor 594, and Texas Red-X. 

A wide variety of other amine-reactive and thiol-reactive fluorophores are 

20 available commercially (Molecular Probes, Inc., Eugene, OR, USA), including Alexa 

?\uoT® 350, Alexa Fluor® 488, Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 568, 
Alexa Fluor*® 594, Alexa Fluor® 647 (monoclonal antibody labeling kits available from 
Molecular Probes, Inc., Eugene, OR, USA), BODIPY dyes, such as BODIPY 493/503, 
BODIPY FL, BODIPY R6G, BODIPY 530/550, BODIPY TMR, BODIPY 558/568, 

25 BODIPY 558/568, BODIPY 564/570, BODIPY 576/589, BODIPY 581/591 . BODIPY 
TR, BODIPY 630/650, BODIPY 650/665, Cascade Blue, Cascade Yellow, Dansyl, 
lissamine rhodaniine B, Marina Blue, Oregon Green 488, Oregon Green 514, Pacific Blue, 
rhodamine 6G, rhodamine green, rhodamine red, tetramethylrhodamine, Texas Red 
(available from Molecular Probes, Inc., Eugene, OR, USA), 

30 The polypeptides of the present invention can also be conjugated to fluorophores, 

other proteins, and other macromolecules, using biftinctional linking reagents. Common 
. homobifunctional reagents include, e.g,, APG, AEDP, BASED, BMB, BMDB, BMH, 
BMOE, BM[PE0]3, BM[PE0]4, BS3. BSOCOES. DFDNB, DMA, DMP, DMS, DPDPB, 
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DSG, DSP (Lomant's Reagent), DSS, DST, DTBP, DIME, DTSSP, EGS, HBVS, 
Sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS (all available from Pierce, Rockford, IL, USA); 
common heterobifunctional cross-linkers include ABH, AMAS, ANB-NOS, APDP, 
ASBA, BMPA, BMPH, BMPS, EDC, EMCA, EMCH, EMCS, KMUA, KMUH, GMBS, 
5 LC-SMCC, LC-SPDP, MBS, M2C2H, MPBH, MSA, NHS-ASA, PDPH, PMPI, SADP, 
SAED, SAND, SANPAH, SASD, SATP, SBAP, SFAD, SIA, SIAB, SMCC, SMPB, 
SMPH, SMPT, SPDP, Sulfo-EMCS, Sulfo-GMBS, Sulfo-HSAB. Sulfo-KMUS, 
Sulfo-LC-SPDP, Sulfo-MBS, Sulfo-NHS-LC-ASA, Sulfo-S.ADP, Sulfo-SANPAH, 
Sulfo-SIAB, Sulfo-SMCC, Sulfo-SMPB, Sulfo-LC-SMPT. SVSB, TFCS (all available 

1 0 Pierce, Rockford, IL, USA). 

PoKpeplides of the present invention, including full length poh^peptides, 
fragments and fusion proteins, can be conjugated, using such cross-linking reagents, to 
fluorophores that are not amine- or thiol-reactive. Other labels that usefully can be 
conjugated to polypeptides of the present invention include radioactive labels, 

1 5 echosonographic contrast reagents, and MRI contrast agents. 

Polypeptides of the present invention, including full length polypeptides, 
fragments and fusion proteins, can also usefully be conjugated using cross-linking agents 
to carrier proteins, such as KLH, bovine thyroglobulin, and even bovine serum albumin 
(BSA), to increase immunogenicity for raising anti-CSP antibodies. 

20 Polypeptides of the present invention, including fiill length polypeptides, 

fragments and fusion proteins, can also usefully be conjugated to polyethylene glycol 
(PEG); PEGylation increases the serum half life of proteins administered intravenously for 
replacement therapy. Delgado et al, CriU Rev. Ther. Drug Carrier Syst. 9(3-4): 249-304 
(1992); Scott et al, Curr Pharnu Des. 4(6): 423-38 (1998); DeSantis et al, Cun\ Opin 

25 Biotechnol 10(4): 324-30 (1999). PEG monomers can be attached to the protein directly 
or through a linker, with PEGylation using PEG monomers activated with tresyl chloride 
(2,2,2-trifluoroethanesulphonyl chloride) permitting direct attachment under mild 
conditions. 

Polypeptides of the present invention are also inclusive of analogs of a polypeptide 
30 encoded by a nucleic acid molecule according to the instant invention. In a preferred 

embodiment, this polypeptide is a CSP. In a more preferred embodiment, this polypeptide 
is derived from a polypeptide having part or all of the amino acid sequence of SEQ ID 
NO: 96-237. Also preferred is an analog polypeptide comprising one or more 
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substitutions of non-natural amino acids or non-native inter-residue bonds compared to the 
naturally occurring polj'peptide. In one embodiment, the analog is structurally similar to a 
CSP, but one or more peptide linkages is replaced by a linkage selected from the group 
consisting of .•CH2NH-, -CHiS-, -CH2-CH2-, -CH-CH-(cis and trans), -COCH2-, 
5 -CH(0H)CH2- and -CH2SO-. In another embodiment, the analog comprises 

substitution of one or more amino acids of a CSP with a D-amino acid of the same type or 
other non-natural amino acid in order to generate more stable peptides. D-amino acids can 
readily be incorporated during chemical peptide synthesis: peptides assembled from 
D-amino acids are more resistant to proteolytic attack; incorporation of D-amino acids can 

10 also be used to confer specific three-dimensional conformations on the peptide. Other 
amino acid analogues commonly added during chemical s>Tithesis include ornithine, 
norieucine, phosphorjiated amino acids (tj^pically phosphoserine, phosphothreonine, 
phosphot}TOsine), L-malonyltyrosine, a non-hydrolyzable analog of phosphotyrosine (see, 
e.g., Kole et al, Biochem. Biophys, Res. Com. 209: 817-821 (1995)), and various 

15 halogenated phenylalanine derivatives. 

Non-natural amino acids can be incorporated during solid phase chemical synthesis 
or by recombinant techniques, although the fomer is tj^ically more common. Solid 
phase chemical synthesis of peptides is well established in the art. Procedures are 
described, inter alia, in Chan et al. (eds.), Fmoc Solid Phase Peptide Synthesis: A 

20 Practical Approach (Practical Approach Series), Oxford Univ. Press (March 2000); Jones, 
Amino Acid and Peptide Svnthesis (Oxford Chemistrj' Primers, No 7), Oxford Univ. Press 
(1992); and Bodanszky, Principles of Peptide Svnthesis (Springer Laboratory), Springer 
Verlag(1993). 

Amino acid analogues having detectable labels are also usefully incorporated 
25 during synthesis to provide derivatives and analogs. Biotin, for example can be added 
using biotinoyl-(9-fluorenylmethoxycarbonyl)-L-lysine (FMOC biocytin) (Molecular 
Probes, Eugene, OR, USA). Biotin can also be added enzymatically by incorporation into 
a fusion protein of an E. coli BirA substrate peptide. The FMOC and /HOC derivatives of 
dabcyl-L-lysine (Molecular Probes, Inc., Eugene, OR, USA) can be used to incorporate 
30 the dabcyl chromophore at selected sites in the peptide sequence during synthesis. The 
aminonaphthalene derivative EDANS, the most common fluorophore for pairing with the 
dabcyl quencher in fluorescence resonance energy transfer (FRET) systems, can be 
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introduced during automated s>Tithesis of peptides by using EDANS-FMOC-L-glutamic 
acid or the corresponding /BOC derivative (both from Molecular Probes, Inc., Eugene, 
OR, USA). Tetramethylrhodamine fluorophores can be incorporated during automated 
FMOC sjTithesis of peptides using (FMOC)-TMR-L-lysine (Molecular Probes, Inc. 
5 Eugene, OR, USA). 

Other useful amino acid analogues that can be incorporated during chemical 
synthesis include aspartic acid, glutamic acid, lysine, and tyrosine analogues having allyl 
side-chain protection (Applied Biosystems, Inc., Foster City, CA, USA); the allyl side 
chain permits synthesis of cyclic, branched-chain, sulfonated, glycosylated, and 

10 phosphor}'lated peptides. 

A large number of other FMOC-protected non-natural amino acid analogues 
capable of incoiporation during chemical synthesis are available commercially, including, 
€,g.y Fmoc-2-aminobicyclo[2.2.1]heptane-2-carboxylic acid, Fmoc-3-endo- 
aminobicyclo[2.2.1 ]heptane-2-endo-carboxylic acid, Fmoc-3-exo- 

15 aminobicyclo[2.2.1]heptane-2-exo-carboxylic acid, Fmoc-3-endo-amino- 

bicyclo[2.2.1]hept-5-ene-2-endo-carboxylic acid, Fmoc-3-exo-amino-bicyc]o[2.2.1]hept- . 
5-ene-2-exo-carboxylic acid, Fmoc-cis-2-amino-l-cyclohexanecarboxylic acid, Fmoc- 
trans-2-amino-l -cyclohexanecarboxylic acid, Fmoc-1 -amino- l-cyclopentanecarboxylic 
acid, Fmoc-cis-2-amino- l-cyclopentanecarboxylic acid, Fmoc-1 -amino-!- 

20 cyclopropanecarboxylic acid, Fmoc-D-2-amino-4-(ethylthio)but>'ric acid, Fmoc-L-2- 
amino-4-(ethylthio)butyric acid, Fmoc-L-buthionine, Fmoc-S-methyl-L-Cysteine, Fmoc- 
2-aminoben2oic acid (antliranillic acid), Fmoc •3-aminobenzoic acid, Fmoc-4- 
aminobenzoic acid, Fmoc-2-aminobenzophenone-2'-carboxylic acid, Fmoc-N-(4- 
aminobenzoyl)-P-alanine, Fnioc-2-amino-4,5-dimethoxybenzoic acid, Fmoc-4- 

25 aminohippuric acid, Fmoc-2-amino-3-hydroxybenzoic acid, Fmoc-2-amino-5- 
hydroxybenzoic acid, Fmoc-3-amino-4-hydroxybenzoic acid, Fmoc-4-amino-3- 
hydroxybenzoic acid, Fmoc-4-amino-2-hydroxybenzoic acid, Fmoc-5-amino-2- 
hydroxybenzoic acid, Fmoc-2-amino-3-methoxybenzoic acid, Fmoc-4-amino-3- 
niethoxybenzoic acid, Fmoc-2-amino-3-methylbenzoic acid, Fmoc-2-amino-5- 

30 niethylbenzoic acid, Fmoc-2-amino-6-methylben2oic acid, Fmoc-3-amino-2- 
methylbenzoic acid, Fmoc-3-amino-4-methylbenzoic acid, Fmoc-4-amino-3- 
methylbenzoic acid, Fmoc-3-amino-2-naphtoic acid, Fnioc-D,L-3-amino-3- 
phenylpropionic acid, Fmoc-L-Methyldopa, Fmoc-2-amino-4,6-dimethyl-3- 
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p\Tidinecarboxylic acid, Fmoc-D,L-amino-2-thiophenacetic acid, Fmoc-4- 
(carboxymethyl)piperazine, Fmoc-4-carbox>'pipera2ine, Finoc-4- 
(carboxymethyl)homopipera2ine, Fmoc-4-phenyl-4-piperidinecarboxylic acid, Fmoc-L- 
l,2,3,4-tetrahydronorharman-3-carboxylic acid, Fmoc-L-thiazolidine-4-carboxylic acid, all 
5 available from The Peptide Laboratory (Richmond, CA, USA). 

Non-natural residues can also be added biosynthetically by engineering a 
suppressor tRNA, t>T)icaIly one that recognizes the UAG stop codon, by chemical 
aminoacylation with the desired unnatural amino acid. Conventional site-directed 
mutagenesis is used to introduce the chosen stop codon UAG at the site of interest in the 
10 protein gene. When the acylated suppressor tRNA and the mutant gene are combined in 
an in vitro transcription/translation system, the unnatural amino acid is incorporated in 
response to the UAG codon to give a protein containing that amino acid at the specified 
position. Liu ei al, Proc Natl Acad, ScL USA 96(9): 4780-5 (1999); Wang et al. Science 
292(5516): 498-500 (2001). 

1 5 Fusion Proteins 

Another aspect of the present invention relates to the fusion of a polypeptide of the 
present invention to heterologous polypeptides. In a preferred embodiment, the 
polypeptide of the present invention is a CSP. In a more preferred embodiment, the 
polypeptide of the present invention that is fused to a heterologous polypeptide which 

20 comprises part or all of the amino acid sequence of SEQ ID NO: 96-237, or is a mutein, 
homologous polypeptide, analog or derivative thereof In an even more preferred 
embodiment, the fusion protein is encoded by a nucleic acid molecule comprising all or 
part of the nucleic acid sequence of SEQ ID NO: 1-95, or comprises all or part of a nucleic 
acid sequence that selectively hybridizes or is homologous to a nucleic acid molecule 

25 comprising a nucleic acid sequence of SEQ ID NO: 1-95. 

The fusion proteins of the present invention will include at least one fragment of a 
polypeptide of the present invention, which fragment is at least 6, typically at least 8, often 
at least 15, and usefully at least 16, 17, IS, 19, or 20 amino acids long. The fragment of 
the polypeptide of the present to be included in the fusion can usefully be at least 25 

30 amino acids long, at least 50 amino acids long, and can be at least 75, 100, or even 150 
amino acids long. Fusions that include the entirety of a polypeptide of the present 
invention have particular utility. 
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The heterologous polypeptide included within the fusion protein of the present 
invention is at least 6 amino acids in length, often at least S amino acids in length, and 
preferably at least 15. 20, or 25 amino acids in length. Fusions that include larger 
polypeptides, such as the IgG Fc region, and even entire proteins (such as GFP 
5 chromophore-containing proteins) are particularly useful. 

As described above in the description of vectors and expression vectors of the 
present invention, which discussion is incorporated here by reference in its entirety, 
heterologous polj'peptides to be included in the fusion proteins of the present invention 
can usefully include those designed to facilitate purification and/or visualization of 

10 recombinantly-expressed proteins. See, e.g., Ausubel, Chapter 16, (1992), supra. 
Although purification tags can also be incorporated into fusions that are chemically 
sjTithesized, chemical synthesis tj'pically provides sufficient purity that further 
purification by HPLC suffices; however, visualization tags as above described retain their 
utilit>' even when the protein is produced by chemical synthesis, and when so included 

1 5 render the fusion proteins of the present invention useful as directly detectable markers of 
the presence of a poljpeptide of the invention. 

As also discussed above, heterologous polypeptides to be included in the fusion 
proteins of the present invention can usefully include those that facilitate secretion of 
recombinantly expressed proteins into the periplasmic space or extracellular milieu for 

20 prokaryotic hosts or into the culture medium for eukarj'otic cells through incorporation of 
secretion signals and/or leader sequences. For example, a His^ tagged protein can be 
purified on a Ni affinitj' column and a GST fusion protein can be purified on a glutathione 
affinit>' column. Similarly, a fusion protein comprising the Fc domain of IgG can be 
purified on a Protein A or Protein G column and a fusion protein comprising an epitope 

25 tag such as myc can be purified using an immunoaffinity column containing an anti-c-myc 
antibody. It is preferable that the epitope tag be separated from the protein encoded by the 
essential gene by an enzymatic cleavage site that can be cleaved after purification. See 
also the discussion of nucleic acid molecules encoding fusion proteins that may be 
expressed on the surface of a cell 

30 Other useful fusion proteins of the present invention include those that permit use 

of the polypeptide of the present invention as bait in a yeast two-hybrid system. See 
-Bartei et al (eds;), The Yeast Two-Hvbrid Svstem . Oxford University Press (1997); Zhu 
et al. Yeast Hybrid Technologies . Eaton Publishing (2000); Fields et al. Trends Genet, 
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10(S): 286-92 (1994); Mendelsohn a/. , Curr. Opin. Biotechnol 5(5): 482-6 (1994); 

Luban et al, Curr Opin, Biotechnol 6(1): 59-64 (1995); Allen ei al. Trends Biochem. 

Sci. 20(12): 511-6 (1995); Drees, Curr, Opin, Chem. Biol 3(1): 64-70 (1999); Topcu et 

al, Pharm, Res, 17(9): 1049-55 (2000); Fashena et al, Gene 250(1-2): M4 (2000); Colas 
5 et al. Nature 380, 548-550 (1996); Norman, et al. Science 285, 591-595 (1999): 

Fabbrizio et al, Oncogene 18, 4357-4363 (1999); Xu et al, Proc Natl Acad Sci USA. 

94, 12473-12478 (1997); Yang, et al, Nuc, Acids Res, 23, 1 152-1 156 (1995); Kolonin et 

al, Proc Natl Acad Sci USA 95, 14266-14271 (1998); Cohen etal, Proc Nail Acad Sci U 

SA 95, 14272-14277 (1998); Uetz,eM/. Ato/re 403, 623-627(2000); lionet al, Proc Natl 
1 0 Acad Sci USA9E, 4569-4574 (2001). Typically, such fusion is to either £. coli LexA or 

yeast GAL4 DNA binding domains. Related bait plasmids are available that express the 

bait fused to a nuclear localization signal. 

Other useful fusion proteins include those that permit display of the encoded 

polypeptide on the surface of a phage or cell, fusions to intrinsically fluorescent proteins, 
1 5 such as green fluorescent protein (GPP), and fusions to the IgG Fc region, as described 

above. 

The polypeptides of the present invention can also usefully be fused to protein 
toxins, such as Pseudomonas exotoxin A, diphtheria toxin, shiga toxin A, anthrax toxin 
lethal factor, or ricin, in order to effect ablation of cells that bind or take up the proteins of 

20 the present invention. 

Fusion partners include, inter alia, myc, hemagglutinin (HA), GST, 
immunoglobulins, p-galactosidase, biotin trpE, protein A, (J-lactamase, a-amylase, 
maltose binding protein, alcohol dehydrogenase, polyhistidine (for example, six histidine 
at the amino and/or carboxyl terminus of the polypeptide), lacZ, green fluorescent protein 

25 (GFP), yeast a mating factor, GAL4 transcription activation or DNA binding domain, 
luciferase, and serum proteins such as ovalbumin, albumin and the constant domain of 
IgG. See, Ausubel (1992), supra and Ausubel (1999), supra. Fusion proteins may 
also contain sites for specific enzj'matic cleavage, such as a site that is recognized by 
enzymes such as Factor XIII, trypsin, pepsin, or any other enzyme known in the art. 

30 Fusion proteins will typically be made by either recombinant nucleic acid methods, as 
described above^ chemically synthesized using techniques well known in the art {e,g., a 
Merrifield synthesis), or produced by chemical cross-linking. 
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Another advantage of fusion proteins is that the epitope tag can be used to bind the 
fusion protein to a plate or column through an affinity linkage for screening binding 
proteins or other molecules that bind to the CSP. 

As further described below, the polypeptides of the present invention can readily 
5 be used as specific immunogens to raise antibodies that specifically recognize 
pol>'peptides of the present invention including CSPs and their allelic variants and 
homologues. The antibodies, in turn, can be used, inter alia, specifically to assay for the 
po!>'peptides of the present invention, particulariy CSPs, e.g. by ELISA for detection of 
protein fluid samples, such as serum, by immunohistochemistry or laser scanning 

10 cytometr}', for detection of protein in tissue samples, or by flow cytometry, for detection 
of intracellular protein in cell suspensions, for specific antibody-mediated isolation and/or 
purification of CSPs, as for example by immunoprecipitation, and for use as specific 
agonists or antagonists of CSPs. 

One may determine whether polypeptides of the present invention including CSPs, 

1 5 muteins, homologous proteins or allelic variants or fusion proteins of the present invention 
are functional by methods known in the art. For instance, residues that are tolerant of 
change while retaining function can be identified by altering the polypeptide at known 
residues using methods known in the art, such as alanine scanning mutagenesis, 
Cunningham et ah. Science 244(4908): 1081-5 (1989); transposon linker scanning 

20 mutagenesis, Chen et al . Gene 263(1 -2): 39-48 (200 1 ); combinations of homolog- and 
alanine-scanning mutagenesis, Jin et al, J, Mol Biol. 226(3): 851-65 (1992); and 
combinatorial alanine scanning, Weiss et al, Proc. Natl Acad. Sci USA 97(16): 8950-4 
(2000), followed by functional assay, Transposon linker scanning kits are available 
commercially (New England Biolabs, Beverly, MA, USA, catalog, no. E7-102S; 

25 EZ::TNTM In-Frame Linker Insertion Kit, catalogue no. EZI04KN, (Epicentre 
Technologies Corporation, Madison, WI, USA). 

Purification of the polj'peptides or fusion proteins of the present invention is well 
known and within the skill of one having ordinary skill in the art. See, e.g., Scopes, 
Protein Purification . 2d ed. (1987). Purification of recombinantly expressed polypeptides 

30 is described above. Purification of chemically-synthesized peptides can readily be 
effected, e.g.,byHPLC 

Accordingly, it is an aspect of the* present invention to provide the isolated 
pol>T)eptides or fusion proteins of the present invention in pure or substantially pure form 
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in the presence or absence of a stabilizing agent. Stabilizing agents include both 
proteinaceous and non-proteinaceous material and are well known in the art. Stabilizing 
agents, such as albumin and polyethylene glycol (PEG) are known and are commercially 
available. 

5 Although high levels of purity are preferred when the isolated polypeptide or 

fusion protein of the present invention are used as therapeutic agents, such as in vaccines 
and replacement therapy, the isolated polypeptides of the present invention are also useful 
at lower purit)'. For example, partially purified polypeptides of the present invention can 
be used as immunogens to raise antibodies in laboratory' animals. 

10 In a preferred embodiment, the purified and substantially purified polypeptides of 

the present invention are in compositions that lack detectable ampholytes, acrylamide 
monomers, bis-acr>'lamide monomers, and polyacr>iamide. 

The pohpeptides or fusion proteins of the present invention can usefully be 
attached to a substrate. The substrate can be porous or solid, planar or non-planar; the 

1 5 bond can be covalent or noncovalent. For example, the peptides of the invention may be 
stabilized by covalent linkage to albumin. See, U.S. Patent No. 5,876,969, the contents of 
which are hereby incorporated in its entirety. 

The polypeptides or fusion proteins of the present invention can also be usefully 
bound to a porous substrate, commonly a membrane, typically comprising nitrocellulose, 

20 polwinylidene fluoride (PVDF), or cationically derivatized, hydrophilic PVDF; so bound, 
the polypeptides or fusion proteins of the present invention can be used to detect and 
quantify antibodies, e,g. in serum, that bind specifically to the immobilized polypeptide or 
fusion protein of the present invention. 

As another example, the polypeptides or fusion proteins of the present invention 

25 can usefully be bound to a substantially nonporous substrate, such as plastic, to detect and 
quantify' antibodies, e.g. in serum, that bind specifically to the immobilized protein of the 
present invention. Such plastics include polymethylacry^ic, polyethylene, polypropylene, 
polyacrylate, polymethylmethacrylate, poly\'inylchloride, polytetrafluoroethylene, 
polystyrene, polycarbonate, polyacetal, polysulfone, celluloseacetate, cellulosenitrate, 

30 nitrocellulose, or mixtures thereof; when the assay is performed in a standard microtiter 
dish, the plastic is typically polystyrene. 

The polypeptides and fusion proteins of the present invention can also be attached 
to a substrate suitable for use as a surface enhanced laser desorption ionization source; so 
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attached, the pol>'peptide or fusion protein of the present invention is useful for binding 
and then detecting secondary proteins that bind with sufficient affinit)' or avidity to the 
surface-bound polypeptide or fusion protein to indicate biologic interaction there between. 
The polj^peptides or fusion proteins of the present invention can also be attached to a 
5 substrate suitable for use in surface plasmon resonance detection; so attached, the 
polypeptide or fusion protein of the present invention is useful for binding and then 
detecting secondary proteins that bind with sufficient affinity or avidity to the surface- 
bound pol\T?eptide or fusion protein to indicate biological interaction there between. 

Alternative Transcripts 

10 In another aspect, the present invention provides splice variants of genes and 

proteins encoded thereby. The identification of a novel splice variant which encodes an 
amino acid sequence with a novel region can be targeted for the generation of reagents for 
use in detection and/or tt-eatment of cancer. The novel amino acid sequence may lead to a 
unique protein structure, protein subcellular localization, biochemical processing or 

15 function of the splice variant. This information can be used to directly or indirectly 

facilitate the generation of additional or novel therapeutics or diagnostics. The nucleotide 
sequence in this novel splice variant can be used as a nucleic acid probe for the diagnosis 
and/or treatment of cancer. 

Specifically, the newly identified sequences may enable the production of new 

20 antibodies or compounds directed against the novel region for use as a therapeutic or 
diagnostic. Alternatively, the newly identified sequences may alter the biochemical or 
biological properties of the encoded protein in such a way as to enable the generation of 
improved or different therapeutics targeting this protein. 

Antibodies 

25 In another aspect, the invention provides antibodies, including fragments and 

derivatives thereof, that bind specifically to polypeptides encoded by the nucleic acid 
molecules of the invention. In a preferred embodiment, the antibodies are specific for a 
polypeptide that is a CSP, or a fragment, mutein, derivative, analog or fusion protein 
thereof. In a more preferred embodiment, the antibodies are specific for a polypeptide that 

30 comprises SEQ ID NO: 96-237, or a fragment, mutein, derivative, analog or fusion protein 
thereof. 
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The antibodies of the present invention can be specific for Hnear epitopes, 
discontinuous epitopes, or conformational epitopes of such proteins or protein fragments, 
either as present on the protein in its native conformation or, in some cases, as present on 
the proteins as denatured, as, e.g., by solubilization in SDS. New epitopes may also be 
5 due to a difference in post translational modifications (PTMs) in disease versus normal 
tissue. For example, a particular site on a CSP may be glycosylated in cancerous cells, but 
not glycosylated in normal cells or vice versa. In addition, alternative splice forms of a 
CSP may be indicative of cancer. Differential degradation of the C or N-terminus of a 
CSP may also be a marker or target for anticancer therapy. For example, a CSP may be 

10 N-terminal degraded in cancer cells exposing new epitopes to antibodies which may 
selectively bind for diagnostic or therapeutic uses. 

As is well known in the art, the degree to which an antibody can discriminate 
among molecular species in a mixture will depend, in part, upon the conformational 
relatedness of the species in the mixture; typically, the antibodies of the present invention 

1 5 will discriminate over adventitious binding to non-CSP polypeptides by at least t\^'0-fold, 
more tj^pically by at least 5-fold, typically by more than 10-fold, 25-fold, 50-fold, 75-fold, 
and often by more than 100- fold, and on occasion by more than 500-fold or 1000-fold. 
When used to detect the proteins or protein fragments of the present invention, the 
antibody of the present invention is sufficiently specific when it can be used to determine 

20 the presence of the polypeptide of the present invention in samples derived from human 
colon. 

Typically, the affinity or avidity of an antibody (or antibody multimer, as in the 
case of an IgM pentamer) of the present invention for a protein or protein fragment of the 
present invention will be at least about 1x10"^ molar (M), typically at least about 5x10"^ 
25 M, 1 x 10"'' M, with affinities and avidities of at least 1 x 10"^ M, 5 x 10"^ M, 1 x 10'^° M 
and up to 1 X 10'*^ M proving especially useful. 

The antibodies of the present invention can be naturally occurring forms, such as 
IgG, IgM, IgD, IgE, IgY, and IgA, from any avian, reptilian, or mammalian species. 

Human antibodies can, but will infrequently, be drawn, directly from human donors 
30 or human cells. In such case, antibodies to the pol>T)eptides of the present invention will 
typically have resulted from fortuitous immunization, such as autoimmune immunization, 
with the polypeptide of the present invention. Such antibodies will typically, but will not 
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invariably, be polyclonal. In addition, individual polyclonal antibodies may be isolated 
and cloned to generate monoclonals. 

Human antibodies are more frequently obtained using transgenic animals that 
express human immunoglobulin genes, which transgenic animals can be affirmatively 

5 immunized with the protein immunogen of the present invention. Human Ig-transgenic 
mice capable of producing human antibodies and methods of producing human antibodies 
therefrom upon specific immunization are described, inter alia, in U.S. Patent Nos. 
6,162,963; 6,150,584; 6,114,598; 6,075,181; 5,939,598; 5,877,397; 5,874,299; 5,814,318; 
5,789,650; 5,770,429; 5,661,016; 5,633,425; 5,625,126; 5,569,825; 5,545,807; 5,545,806, 

10 and 5,591,669, the disclosures of which are incorporated herein by reference in their 
entireties. Such antibodies are t>TDical]y monoclonal, and are typically produced using 
techniques developed for production of murine antibodies. 

Human antibodies are particularly useful, and often preferred, when the antibodies 
of the present invention are to be administered to human beings as in vivo diagnostic or 

15 therapeutic agents, since recipient immune response to the administered antibody will 
often be substantially less than that occasioned by administration of an antibody derived 
from another species, such as mouse. 

IgG, IgM, IgD, IgE, IgY, and IgA antibodies of the present invention are also 
usefully obtained from other species, including mammals such as rodents (typically 

20 mouse, but also rat, guinea pig, and hamster), lagomorphs (t>'pically rabbits), and also 
larger mammals, such as sheep, goats, cows, and horses; or egg laying birds or reptiles 
such as chickens or alligators. In such cases, as with the transgenic human-antibody- 
producing non-human mammals, fortuitous immunization is not required, and the non- 
human mammal is typically affirmatively immunized, according to standard immunization 

25 protocols, with the polypeptide of the present invention. One fonn of avian antibodies 
may be generated using techniques described in WO 00/29444, published 25 May 2000, 
which is herein incorporated by reference in its entirety. 

As discussed above, virtually all fragments of 8 or more contiguous amino acids of 
a polypeptide of the present invention can be used effectively as immunogens when 

30 conjugated to a carrier, typically a protein such as bovine thyroglobulin, keyhole limpet 
hemocyanin, or bovine serum albumin, conveniently using a bifunctional linker such as 
- those described elsev/here above, which discussion is incorporated by reference here. 
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Immunogenicity can also be conferred by fusion of the pol>'peptides of the present 
invention to other moieties. For example, polji^eptides of the present invention can be 
produced by solid phase sjnthesis on a branched polylysine core matrix; these multiple 
antigenic peptides (MAPs) provide high purity, increased avidity, accurate chemical 
5 definition and improved safety in vaccine development. Tam et al, Proc, Natl. Acad. ScL 
USA 85: 5409-5413 (1988); Posnett BioL Chenu 263: 1719-1725 (1988). 

Protocols for immunizing non-human mammals or avian species are well- 
established in the art. See Harlow et al (eds.), Using Antibodies: A Laborator\- Manual 
Cold Spring Harbor Laboratory' (1998); Coligan ei al, (eds.), Current Protocols in 
10 Immunology, John Wiley & Sons, Inc. (2001); Zola, Monoclonal Antibodies: Preparation 
and Use of Mo noclonal Antibodies and Engineered Antibodv Derivatives (Basics: From 
Background to BenchV Springer Verlag (2000); Gross M, Speck J.Dtsck Tierarztl 
Wochensch}\ 103: 417-422 (1996). Immunization protocols often include multiple 
immunizations, either with or without adjuvants such as Freund's complete adjuvant and 
15 Freund*s incomplete adjuvant, and may include naked DNA immunization. Moss, Semin, 
Immunol 2: 317-327(1990). 

Antibodies from non-human mammals and avian species can be polyclonal or 
monoclonal, with polyclonal antibodies having certain advantages in 
immunohistochemical detection of the pol^^peptides of the present invention and 
20 monoclonal antibodies having advantages in identifying and distinguishing particular 
epitopes of the polj^peptides of the present invention. Antibodies from avian species may 
have particular advantage in detection of the polipeptides of the present invention, in 
human serum or tissues. Vikinge et al., Biosens, Bioelectron, 13: 1257-1262 (1998), 
Following immunization, the antibodies of the present invention can be obtained using any 
25 art-accepted technique. Such techniques are well known in the art and are described in 
detail in references such as Coligan, supra; Zola, supra; Howard et al. (eds.), Basic 
Methods in Antibody Production and Characterization . CRC Press (2000); Harlow, sup?'a; 
Davis (ed.), Monoclonal Antibodv Protocols . Vol. 45, Humana Press (1995); Delves (ed.), 
Antibodv Production: Essential Techniques . John Wiley & Son Ltd (1997); and Kenney, 
30 Antibodv Solution: An Antibodv Methods Manual . Chapman & Hall (1997). 

Briefly, such techniques include, inter alia, production of monoclonal antibodies 
by hybridomas and expression of antibodies or fragments or derivatives thereof from host 
cells engineered to express immunoglobulin genes or fragments thereof. These two 



wo 2004/050860 



PCT/US2003/040063 



93 

methods of production are not mutually exclusive: genes encoding antibodies specific for 
the polypeptides of the present invention can be cloned from hybridomas and thereafter 
expressed in other host cells. Nor need the nvo necessarily be performed together: e.g., 
genes encoding antibodies specific for the pol>'peptides of the present invention can be 
5 cloned directly from B cells known to be specific for the desired protein, as further 

described in U.S. Patent No. 5,627,052, the disclosure of which is incorporated herein by 
reference in its entiret>% or from antibody-displaying phage. 

Recombinant expression in host cells is particularly useful when fragments or 
derivatives of the antibodies of the present invention are desired. 
10 Host cells for recombinant antibody production of whole antibodies, antibody 

fragments, or antibody derivatives can be prokaryotic or eukar>'0tic. 

Prokapy'otic hosts are particularly useful for producing phage displayed antibodies 
of the present invention. 

The technolog>' of phage-displayed antibodies, in which antibody variable region 
1 5 fragments are fused, for example, to the gene III protein (pIII) or gene VIII protein (pVIII) 
for display on the surface of filamentous phage, such as Ml 3, is by now well-established. 
See. e.g., Sidhu, Cwr. Opin. Biotechnol 11(6): 610-6 (2000); Griffiths etal, Cwr. Opin. 
Biotechnoi 9(1): 102-S (1998); Hoogenboom et alJmmwwtechnolog\>, 4(1): 1-20 (1998); 
Rader et al. Current Opinion in Biotechnology^ 8: 503-508 (1997); Aujame et al, Human 
20 Antibodies S: 155-168 (1997); Hoogenboom, Trends in Biotechnol. 15: 62-70 (1997); de 
Kiuif et aL, 17: 453-455 (1996); Barbas et al. Trends in Biotechnol 14: 230-234 (1996): 
Winter et al, Ann. Rev. Immunol, 433-455 (1994). Techniques and protocols required to 
generate, propagate, screen (pan), and use the antibody fragments from such libraries have 
recently been compiled. See, e.g., Barbas (2001), supra; Kay, supra; and Abelson, supra. 
25 Typically, phage-displayed antibody fragments are scFv fragments or Fab 

fragments; when desired, full length antibodies can be produced by cloning the variable 
regions from the displaying phage into a complete antibody and expressing the fill) length 
antibody in a further prokaryotic or a eukaryotic host cell Eukatyotic cells are also useful 
for expression of the antibodies, antibody fragments, and antibody derivatives of the 
30 present invention. For example, antibody fragments of the present invention can be 

produced in Pichia pastoris and in Saccharomyces cerevisiae. See, e.g., Takahashi et al^ 
BioscL.Biotedvwl.£iochem. 64(10): 2138-44 (2000); Freyre et al, J. Biotechnol. 
76(2-3):l 57-63 (2000); Fischer et ai, Biotechnoi. AppL Biochem, 30 (Pt 2): 1 17-20 
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(1999); Pennell et ah. Res. Immunol 149(6): 599-603 (199S); Eldin et alj. Immimol 
Methods. 201(1): 67-75 (1997);, Frenken et al, Res. Immunol 149(6): 589-99 (1998); and 
Shusta et al, Nature Biotechnol 16(8): 773-7 (1998). 

Antibodies, including antibody fragments and derivatives, of the present invention 
5 can also be produced in insect cells. See, e.g., Li et al, Protein Expr. Purif. 21(1): 121-8 
(2001); Ailor et al, Biotechnol Bioeng. 58(2-3): 196-203 (1998); Hsu et al, Biotechnol 
Prog. 13(1): 96-104 (1997); Edelman et al. Immunology^ 91(1): 13-9 (1997); and Nesbit et 
al, J. Immunol Methods 151(1-2): 201-8 (1992). 

Antibodies and fragments and derivatives thereof of the present invention can also 

10 be produced in plant cells, particularly maize or tobacco, Giddings et al , Nature 

Biotechnol 18(11): 1151-5 (2000)] Gsivilondoetal.Biotechniques 29(1): 128-38(2000); 
Fischer et al, J. Biol Regul Homeost. Agents 14(2): 83-92 (2000); Fischer et al, 
Biotechnol Appl Biochem. 30 (Pt 2): 113-6 (1999); Fischer et al.Biol Chem. 380(7-8): 
825-39 (1999); Russell, Curr. Top. Microbiol Immunol 240: 1 19-38 (1999); and Ma et 

15 al. Plant Physiol 1 09(2): 34 1 -6 ( 1 995). 

Antibodies, including antibody fragments and derivatives, of the present invention 
can also be produced in transgenic, non-human, mammalian milk. See, e.g. Pollock et al., 
J. Immunol Methods. 231: 147-57 (1999); Young etal.. Res. Immunol. 149:609-10 
(1998); and Limonta et al., Immunotechnologyf 1: 107-13(1995). 

20 Mammalian cells useful for recombinant expression of antibodies, antibody 

fragments, and antibody derivatives of the present invention include CHO cells, COS 
cells, 293 cells, and myeloma cells. Verma et al, J. Immunol Methods 2 16(1 -2): 165-81 

(1998) xtvitw and compare bacterial, yeast, insect and mammalian expression systems for 
expression of antibodies. Antibodies of the present invention can also be prepared by cell 

25 free translation, as further described in Merk et al, J. Biochem. (Tokyo) 125(2): 328-33 

(1999) and Ryabova et al. Nature Biotechnol 15(1): 79-84 (1997), and in the milk of 
transgenic animals, as further described in Pollock et al, J. Immunol Methods 231(1-2): 
147-57(1999). 

The invention further provides antibody fragments that bind specifically to one or 
30 more of the poI)peptides of the present invention or to one or more of the polji^eptides 
encoded by the isolated nucleic acid molecules of the present invention, or the binding of 
which cair be competitively inhibited by one or more of the polypeptides of the present 
invention or one or more of the polypeptides encoded by the isolated nucleic acid 
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molecules of the present invention. Among such useful fragments are Fab, Fab\ Fv, 
F(ab)'2j and single-chain Fv (scFv) fragments. Other useful fragments are described in 
Hudson, Cwr. Opuh BiotecIuwL 9(4): 395-402 (1998). 

The present invention also relates to antibody derivatives that bind specifically to 
5 one or more of the pol>'peptides of the present invention, to one or more of the 

polypeptides encoded by the isolated nucleic acid molecules of the present invention, or 
the binding of which can be competitively inhibited by one or more of the polypeptides of 
the present invention or one or more of the polypeptides encoded by the isolated nucleic 
acid molecules of the present invention. 
10 Among such useful derivatives are chimeric, primatized, and humanized 

antibodies; such derivatives are less immunogenic in human beings, and thus are more 
suitable for i?i vivo administration, than are unnioiJified antibodies from non-human 
mammalian species. Another useful method is PEGylation to increase the serum half life 
of the antibodies. 

1 5 Chimeric antibodies typically include hea\7 and/or light chain variable regions 

(including both CDR and framework residues) of immunoglobulins of one species, 
tjTDically mouse, fused to constant regions of another species, typically human. See. e.g., 
Morrison et al, Proc. Natl. Acad. Sci USA.&\i2l): 6S51-5 (1984); Sharon etai. Nature 
309(5966): 364-7 (1984); Takeda et al. Nature 314(6010): 452-4 (1985); and U.S. Patent 

20 No. 5,807,715 the disclosure of which is incorporated herein by reference in its entirety. 
Primatized and humanized antibodies typically include heav^ and/or light chain CDRs 
from a murine antibody grafted into a non-human primate or human antibody V region 
framework, usually further comprising a human constant region, Riechmann et al. Nature 
332(6162): 323-7 (1988); Co et al. Nature 351(6326): 501-2 (1991); and U.S. Patent Nos. 

25 6,054,297; 5,821,337; 5,770,196; 5,766,886; 5.821,123; 5,869,619; 6,180,377; 6,013,256; 
5,693,761; and 6,180,370, the disclosures of which are incorporated herein by reference in 
their entireties. Other useful antibody derivatives of the invention include heteromeric 
antibody complexes and antibody fusions, such as diabodies (bispecific antibodies), 
single-chain diabodies, and intrabodies. 

30 It is contemplated that the nucleic acids encoding the antibodies of the present 

invention can be operably joined to other nucleic acids forming a recombinant vector for 
cloning or for expression of the antibodies of the invention. Accordingly, the present 
invention includes any recombinant vector containing the coding sequences, or part 
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thereof, whether for eukatyotic transduction, transfection or gene therapy. Such vectors 
may be prepared using conventional molecular biology techniques, known to those with 
skill in the art, and would comprise DNA encoding sequences for the immunoglobulin V- 
regions including framework and CDRs or parts thereof, and a suitable promoter either 
5 with or without a signal sequence for intracellular transport. Such vectors may be 

transduced or transfected into eukaryotic cells or used for gene therapy (Marasco et al., 
Proc. Natl. Acad Sci. OJSA) 90: 7889-7893 (1993); Duan et al., Proc. Natl Acad, ScL 
(USA) 91 : 5075-5079 (1994), by conventional techniques, known to those with skill in 
the art, 

10 The antibodies of the present invention, including fragments and derivatives 

thereof, can useflilly be labeled. It is, therefore, another aspect of the present invention to 
provide labeled antibodies that bind specifically to one or more of the pol)'peptides of the 
present invention, to one or more of the pol^T^eptides encoded by the isolated nucleic acid 
molecules of the present invention, or the binding of which can be competitively inhibited 

15 by one or more of the pol^qDepiides of the present invention or one or more of the 

polypeptides encoded by the isolated nucleic acid molecules of the present invention. The 
choice of label depends, in part, upon the desired use. 

For example, when the antibodies of the present invention are used for 
immunohistochemical staining of tissue samples, the label can usefully be an enzyme that 

20 catalyzes production and local deposition of a detectable product. Enzymes t>pically 
conjugated to antibodies to pemiit their immunohistochemical visualization are well 
known, and include alkaline phosphatase, (i-galactosidase, glucose oxidase, horseradish 
peroxidase (HRP), and urease. Typical substrates for production and deposition of 
visually detectable products include o-nitrophenyl-beta-D-galactopyranoside (ONPG); 

25 o-phenylenediamine dihydrochloride (OPD); p-nitrophenyl phosphate (PNPP); p- 

nitrophenyl-beta-D-galactopryanoside (PNPG); 3',3'-diaminoben2idine (DAB); 3-amino- 
9-ethylcarbazole (AEC); 4-chloro-l-naphthol (CN); 

5-bromo-4-chloro-3-indolyl-phosphate (BCIP); ABTS®; BluoGal; iodonitrotetrazolium 
(INT); nitroblue tetrazolium chloride (NET); phenazine methosulfate (PMS); 
30 phenolphthalein monophosphate (PMP); tetramethyl benzidine (TMB); tetranitroblue 
tetrazolium (TNBT); X-Gal; X-Gluc; and X-Glucoside. 

Other substrates can be used to produce products for local deposition that are 
luminescent. For example, in the presence of hydrogen peroxide (H2O2), horseradish 
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peroxidase (HRP) can catalyze the oxidation of cyclic diacylhydrazides, such as luminol. 
Immediately following the oxidation, the luminol is in an excited state (intermediate 
reaction product), which decays to the ground state by emitting light. Strong enhancement 
of the light emission is produced by enhancers, such as phenolic compounds. Advantages 
5 include high sensitivit>-, high resolution, and rapid detection without radioactivity and 
requiring only small amounts of antibody. See, e.g., Thorpe et al, Methods Enzymiol 133: 
331-53 (1986); Kricka et ai, J. Immunoassay 17(1): 67-83 (1996); and Lundqvist et al, J- 
Biolumvt Chemilimiin. 10(6): 353-9 (1995). Kits for such enhanced chemiluminescent 
detection (ECL) are available commercially. The antibodies can also be labeled using 
10 colloidal gold. 

As another example, when the antibodies of the present invention are used, e.g., for 
flow cytometric detection, for scanning laser cytometric detection, or for fluorescent 
immunoassay, they can usefully be labeled with fluorophores. There are a wide varietj' of 
fluorophore labels that can usefully be attached to the antibodies of the present invention. 

1 5 For flow cytometric applications, both for extracellular detection and for intracellular 
detection, common useful fluorophores can be fluorescein isothiocyanate (FITC), 
allophycocyanin (APC), R-phycoerythrin (PE). peridinin chlorophyll protein (PerCP), 
Texas Red, Cy3, Cy5, fluorescence resonance energj' tandem fluorophores such as PerCP- 
Cy5.5, PE-Cy5, PE-Cy5.5, PE-Cy7, PE-Texas Red, and APC-Cy7. 

20 Other fluorophores include, inter alia, Alexa Fluor® 350, Alexa Fluor® 488, 

Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 568, Alexa Fluor® 594, Alexa 
Fluor® 647 (monoclonal antibody labeling kits available from Molecular Probes, Inc., 
Eugene, OR, USA), BODIPY dyes, such as BODIPY 493/503, BODIPY FL, BODIPY 
R6G, BODIPY 530/550, BODIPY TMR, BODIPY 558/568, BODIPY 558/568, BODIPY 

25 564/570, BODIPY 576/589, BODIPY 581/591, BODIPY TR, BODIPY 630/650, 
BODIPY 650/665, Cascade Blue, Cascade Yellow, Dansyl, lissamine rhodamine B, 
Marina Blue, Oregon Green 488, Oregon Green 514, Pacific Blue, rhodamine 6G, 
rhodamine green, rhodamine red, tetramethylrhodamine, Texas Red (available firom 
Molecular Probes, Inc., Eugene, OR, USA), and Cy2, Cy3, Cy3.5, Cy5, Cy5.5, Cy7, all of 

30 which are also useful for fluorescently labeling the antibodies of the present invention. 
For secondary detection using labeled avidin, streptavidin, captavidin or neutravidin, the 
antibodies of the present invention can- usefully be labeled with biotin. 
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When the antibodies of the present invention are used, e.g., for western blotting 
applications, they can usefully be labeled with radioisotopes, such as ^^P, ^^P, ^^S, ^H, and 
'•^I. As another example, when the antibodies of the present invention are used for 
radioimmunotherapy, the label can usefully be ^^^Th, ^"Ac, "^Ac, '"^Ra, "'^Bi, "'^Pb, 
5 2'^Bi,2^^At,^^^Pb, »^-»0s, '««Re, '^^Re, ^"Sm, ''^^Tb, ^^^I, ">In, *°^Rh, ^^^Tc^W 
^^Sr,^^^^^Se,^^Cu,or^'Sc. 

As another example, when the antibodies of the present invention are to be used 
for in vivo diagnostic use, they can be rendered detectable by conjugation to MRI contrast 
agents, such as gadolinium diethylenetriaminepentaacetic acid (DTPA), Lauffer et al, 

10 Radiology^ 207(2): 529-38 (1998), or by radioisotopic labeling. 

As would be understood, use of the labels described above is not restricted to the 
application as for which they were mentioned. 

The antibodies of the present invention, including fragments and derivatives 
thereof, can also be conjugated to toxins, in order to target the toxin's ablative action to 

1 5 cells that display and/or express the polypeptides of the present invention. Commonly, the 
antibody in such immunotoxins is conjugated to Pseudomonas exotoxin A, diphtheria 
to.xin, Shiga toxin A, anthrax toxin lethal factor, or ricin. See Hall (ed.), Immunotoxin 
Methods and Protocols (Methods in Molecular Biology, vol. 166), Humana Press (2000): 
and Frankel et al (eds.), Clinical Applications of Immunotoxins . Springer- Verlag (1998), 

20 The antibodies of the present invention can usefully be attached to a substrate, and 

it is, therefore, another aspect of the invention to provide antibodies that bind specifically 
to one or more of the pol}peptides of the present invention, to one or more of the 
polypeptides encoded by the isolated nucleic acid molecules of the present invention, or 
the binding of which can be competitively inhibited by one or more of the polypeptides of 

25 the present invention or one or more of the polypeptides encoded by the isolated nucleic 
acid molecules of the present invention, attached to a substrate. Substrates can be porous 
or nonporous, planar or nonplanar. For example, the antibodies of the present invention 
can usefully be conjugated to filtration media, such as NHS-activated Sepharose or CNBr- 
activated Sepharose for purposes of immunoaffmity chromatography. For example, the 

30 antibodies of the present invention can usefully be attached to paramagnetic microspheres, 
typically by biotin-streptavidin interaction, which microsphere can then be used for 
isolation of cells that express or display the pohT^eptides of the present invention. As 
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another example, the antibodies of the present invention can usefully be attached to the 
surface of a microtiter plate for ELISA. 

As noted above, the antibodies of the present invention can be produced in 
prokaryotic and eukarj'Otic cells. It is, therefore, another aspect of the present invention to 
5 provide cells that express the antibodies of the present invention, including hybridoma 
cells, B cells, plasma cells, and host cells recombinantly modified to express the 
antibodies of the present invention. 

In yet a further aspect, the present invention provides aptamers evolved to bind 
specifically to one or more of the CSPs of the present invention or to polypeptides 
10 encoded by the CSNAs of the invention. 

In sum, one of skill in the art, provided with the teachings of this invention, has 
available a variet>' of methods which may be used to alter the biological properties of the 
antibodies of this invention including methods which would increase or decrease the 
stability or half-life, immunogenicit}', toxicitj', affinit>' or yield of a given antibody 
1 5 molecule, or to alter it in any other way that may render it more suitable for a particular 
application. 

Transgenic Animals and Cells 

In another aspect, the invention provides transgenic cells and non-human 
organisms comprising nucleic acid molecules of the invention. In a preferred 

20 embodiment, the transgenic cells and non-human organisms comprise a nucleic acid 

molecule encoding a CSP. In a preferred embodiment, the CSP comprises an amino acid 
sequence selected from SEQ ID NO: 96-237, or a fragment, mutein, homologous protein 
or allelic variant thereof In another preferred embodiment, the transgenic cells and non- 
human organism comprise a CSNA of the invention, preferably a CSNA comprising a 

25 nucleotide sequence selected from the group consisting of SEQ ID NO: 1-95, or a part, 
substantially similar nucleic acid molecule, allelic variant or hybridizing nucleic acid 
molecule thereof 

In another embodiment, the transgenic cells and non-human organisms have a 
targeted disruption or replacement of the endogenous orthologue of the human CSG. The 
30 transgenic cells can be embrj'onic stem cells or somatic cells. Tlie transgenic non-human 
organisms can be chimeric, nonchimeric heterozygotes, and nonchimeric homo2ygotes. 
Methods of producing transgenic animals are well known in the art. See, e.g., Hogan et 
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al. Manipulating the Mouse Embryo: A Laborator\^ Manual . 2d ed.. Cold Spring Harbor 
Press (1999); Jackson et ai. Mouse Genetics and Transgenics: A Practical Approach ^ 
Oxford Universit}' Press (2000); and Pinkert, Transgenic Animal Technology^: A 
Laborator\^ Handbook . Academic Press (1999). 
5 Any technique known in the art may be used to introduce a nucleic acid molecule 

of the invention into an animal to produce the founder lines of transgenic animals. Such 
techniques include, but are not limited to, pronuclear microinjection, (see, e.g., Paterson 
et al, Appl Microbiol Biotechnol 40: 691-698 (1994); Car\'er et al, Biotechnology 1 1 : 
1263-1270 (1993); Wright et al, Bioteclviology 9: 830-S34 (1991); and U.S. Patent No. 

10 4,873,191 , herein incorporated by reference in its emiret\'); retrovirus-mediated gene 
transfer into germ lines, blastocysts or embryos (see, e.g., Van der Putten et al.Proc. 
Nati Acad, Sci., USA 82: 6148-6152 (1985)); gene targeting in embr>'onic stem cells (see, 
e.g., Thompson et ah, Cell 56: 313-321 (1989)); electroporation of cells or embryos {see, 
e.g., Lo, 19S3, Mol. Cell. Biol 3: 1803-1814 (1983)); introduction using a gene gun {see, 

15 e.g., Ulmer et a/., Science 259: 1745-49 (1993); introducing nucleic acid constructs into 
embryonic pleuripotent stem cells and transferring the stem cells back into the blastocyst; 
and sperm-mediated gene transfer {see, e.g., Lavitrano et al. Cell 57: 717-723 (1989)). 

Other techniques include, for example, nuclear transfer into enucleated oocytes of 
nuclei from cultured embryonic, fetal, or adult cells induced to quiescence {see, e.g., 

20 Campell et al, Nature 380: 64-66 (1996); Wilmut et ai, Nature 385: 810-813 (1997)). 
The present invention provides for transgenic animals that carry the transgene (/.e., a 
nucleic acid molecule of the invention) in all their cells, as well as animals which carry the 
transgene in some, but not all their cells, i.e. e., mosaic animals or chimeric animals. 

The transgene may be integrated as a single transgene or as multiple copies, such 

25 as in concatamers, e. g., head-to-head tandems or head-to-tail tandems. The transgene 
may also be selectively introduced into and activated in a particular cell type by following, 
e.g., the teaching of Lasko et al et al, Proc, Natl Acad. ScL USA 89: 6232- 6236 (1992). 
The regulatory sequences required for such a cell-type specific activation will depend 
upon the particular cell type of interest, and will be apparent to those of skill in the art. 

30 Once transgenic animals have been generated, the expression of the recombinant 

gene may be assayed utilizing standard techniques. Initial screening may be accomplished 
by Southern blo.t analysis or PGR techniques to analyze animal tissues to verify that 
integration of the transgene has taken place. The level of mRNA expression of the 
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transgene in the tissues of the transgenic animals may also be assessed using techniques 
which include, but are not limited to, Northern blot analysis of tissue samples obtained 
from the animal in situ hybridization analysis, and reverse transcriptase-PCR (RT-PCR). 
Samples of transgenic gene-expressing tissue may also be evaluated 
5 immunocytochemically or immunohistochemically using antibodies specific for the 
transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more than 

10 one integration site in order to establish separate lines; inbreeding of separate lines in 

order to produce compound transgenics that express the transgene at higher levels because 
of the effects of additive expression of each transgene; crossing of heterozygous 
transgenic animals to produce animals homozygous for a given integration site in order to 
both augment expression and eliminate the need for screening of animals by DNA 

15 analysis; crossing of separate homozj'gous lines to produce compound heterozygous or 
homozygous lines; and breeding to place the transgene on a distinct background that is 
appropriate for an experimental model of interest 

Transgenic animals of the invention have uses which include, but are not limited 
to, animal model systems useful in elaborating the biological function of poljTDeptides of 

20 the present invention, studying conditions and/or disorders associated with aberrant 
expression, and in screening for compounds effective in ameliorating such conditions 
and/or disorders. 

Methods for creating a transgenic animal with a disruption of a targeted gene are 
also well known in the art. In general, a vector is designed to comprise some nucleotide 

25 sequences homologous to the endogenous targeted gene. The vector is introduced into a 
cell so that it may integrate, via homologous recombination with chromosomal sequences, 
into the endogenous gene, thereby disrupting the function of the endogenous gene. The 
transgene may also be selectively introduced into a particular cell t>pe, thus inactivating 
the endogenous gene in only that cell type. See, e,g„ Gu et al. Science 265: 103-106 

30 (1994). The regulator^' sequences required for such a cell-type specific inactivation will 
depend upon the particular cell type of interest, and will be apparent to those of skill in the 
art. See, e.g., .Smithies et al,.Nature3\l: 230-234 (19S5); Thomas et ai. Cell 51: 503- 
512(1987); Thompson etaL,Cell5: 313-321 (1989). 
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In one embodiment, a mutant, non-functional nucleic acid molecule of the 
invention (or a completely unrelated DNA sequence) flanked by DNA homologous to the 
endogenous nucleic acid sequence (either the coding regions or regulatory regions of the 
gene) can be used, with or without a selectable marker and/or a negative selectable 
5 marker, to transfect cells that express polypeptides of the invention in vivo. In another 
embodiment, techniques known in the art are used to generate knockouts in cells that 
contain, but do not express the gene of interest. Insertion of the DNA construct, via 
targeted homologous recombination, results in inactivation of the targeted gene. Such 
approaches are particularly suited in research and agricultural fields where modifications 

10 to embryonic stem cells can be used to generate animal offspring with an inactive targeted 
gene. See, e.g., Thomas, supra and Thompson, supra. However this approach can be 
routinely adapted for use in humans provided the recombinant DNA constructs are directly 
administered or targeted to the required site in vivo using appropriate viral vectors that 
will be apparent to those of skill in the art. 

1 5 In further embodiments of the invention, cells that are genetically engineered to 

express the poljpeptides of the invention, or alternatively, that are genetically engineered 
not to express the polypeptides of the invention {e.g., knockouts) are administered to a 
patient in vivo. Such cells may be obtained from an animal or patient or an MHC 
compatible donor and can include, but are not limited to fibroblasts, bone marrow cells, 

20 blood cells (e.g., lymphoc>tes), adipoc>tes, muscle cells, endothelial cells etc. The cells 
are genetically engineered in vitro using recombinant DNA techniques to introduce the * 
coding sequence of polypeptides of the invention into the cells, or alternatively, to disrupt 
the coding sequence and/or endogenous regulatory sequence associated with the 
polypeptides of the invention, e.g., by transduction (using viral vectors, and preferably 

25 vectors that integrate the transgene into the cell genome) or transfection procedures, 
including, but not limited to, the use of plasmids, cosmids, YACs, naked DNA, 
electroporation, liposomes, etc. 

The coding sequence of the polypeptides of the invention can be placed under the 
control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 

30 expression, and preferably secretion, of the polypeptides of the invention. The engineered 
cells which express and preferably secrete the polypeptides of the invention can be 
introduced into the patient systemically, e.g., in the circulation, or intraperitoneally. 
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Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, e.g., genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. See, e.g., U.S. Patent Nos. 5,399,349 and 5,460,959, each of which is 
5 incorporated by reference herein in its entirety. 

When the cells to be administered are non-autologous or non-MHC compatible 
cells, they can be administered using well known techniques which prevent the 
development of a host immune response against the introduced cells. For example, the 
cells may be introduced in an encapsulated form which, while allowing for an exchange of 
10 components with the immediate extracellular environment, does not allow the introduced 
cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, but 
are not limited to, animal model systems useful in elaborating the biological function of 
poh^pepiides of the present invention, studying conditions and/or disorders associated with 
15 aberrant expression, and in screening for compounds effective in ameliorating such 
conditions and/or disorders. 

Computer Readable Means 

A further aspect of the invention is a computer readable means for storing the 
nucleic acid and amino acid sequences of the instant invention. In a preferred 

20 embodiment, the invention provides a computer readable means for storing SEQ ID NO: 
96-237 and SEQ ID NO: 1-95 as described herein, as the complete set of sequences or in 
any combination. The records of the computer readable means can be accessed for 
reading and display and for interface with a computer system for the application of 
programs allowing for the location of data upon a query for data meeting certain criteria, 

25 the comparison of sequences, the alignment or ordering of sequences meeting a set of 
criteria, and the like. 

The nucleic acid and amino acid sequences of the invention are particularly useful 
as components in databases useful for search analyses as well as in sequence analysis 
algorithms. As used herein, the terms "nucleic acid sequences of the invention" and 

30 "amino acid sequences of the invention" mean any detectable chemical or physical 

characteristic of a polynucleotide or polypeptide of the invention that is or may be reduced 
to or stored in a computer readable form. These include, without limitation, 
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chromatographic scan data or peak data, photographic data or scan data therefrom, and 
mass spectrographic data. 

This invention provides computer readable media having stored thereon sequences 
of the invention. A computer readable medium may comprise one or more of the 
5 following: a nucleic acid sequence comprising a sequence of a nucleic acid sequence of 
the invention; an amino acid sequence comprising an amino acid sequence of the 
invention; a set of nucleic acid sequences wherein at least one of said sequences comprises 
the sequence of a nucleic acid sequence of the invention; a set of amino acid sequences 
wherein at least one of said sequences comprises the sequence of an amino acid sequence 

10 of the invention; a data set representing a nucleic acid sequence comprising the sequence 
of one or more nucleic acid sequences of the invention; a data set representing a nucleic 
acid sequence encoding an amino acid sequence comprising the sequence of an amino acid 
sequence of the invention; a set of nucleic acid sequences wherein at least one of said 
sequences comprises the sequence of a nucleic acid sequence of the invention; a set of 

1 5 amino acid sequences wherein at least one of said sequences comprises the sequence of an 
amino acid sequence of the invention; a data set representing a nucleic acid sequence 
comprising the sequence of a nucleic acid sequence of the invention; a data set 
representing a nucleic acid sequence encoding an amino acid sequence comprising the 
sequence of an amino acid sequence of the invention. The computer readable medium can 

20 be any composition of matter used to store information or data, including, for example, 
commercially available floppy disks, tapes, hard drives, compact disks, and video disks. 

Also provided by the invention are methods for the analysis of character 
sequences, particularly genetic sequences. Preferred methods of sequence analysis 
include, for example, methods of sequence homology analysis, such as identity and 

25 similarity analysis, RNA structure analysis, sequence assembly, cladistic analysis, 

sequence motif analysis, open reading frame determination, nucleic acid base calling, and 
sequencing chromatogram peak analysis. 

A computer-based method is provided for performing nucleic acid sequence 
identity or similaritj' identification. This method comprises the steps of providing a 

30 nucleic acid sequence comprising the sequence of a nucleic acid of the invention in a 
computer readable medium; and comparing said nucleic acid sequence to at least one 
nucleic acid or amino acid sequence* to identify sequence identity or similarity. 
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A computer-based method is also provided for performing amino acid homolog}' 
identification, said method comprising the steps of: providing an amino acid sequence 
comprising the sequence of an amino acid of the invention in a computer readable 
medium; and comparing said amino acid sequence to at least one nucleic acid or an amino 
5 acid sequence to identify homology'. 

A computer-based method is still further provided for assembly of overlapping 
nucleic acid sequences into a single nucleic acid sequence., said method comprising the 
steps of: providing a first nucleic acid sequence comprising the sequence of a nucleic acid 
of the invention in a computer readable medium; and screening for at least one 
10 overlapping region between said first nucleic acid sequence and a second nucleic acid 
sequence. In addition, the invention includes a method of using patterns of expression 
associated with either the nucleic acids or proteins in a computer-based method to 
diagnose disease. 

Diagnostic Methods for Colon Cancer 

1 5 The present invention also relates to quantitative and qualitative diagnostic assays 

and methods for detecting, diagnosing, monitoring, staging and predicting cancers by 
comparing expression of a CSNA or a CSP in a human patient that has or may have colon 
cancer, or who is at risk of developing colon cancer, with the expression of a CSNA or a 
CSP in a normal human control. For purposes of the present invention, "expression of a 

20 CSNA" or "CSNA expression" means the quantity of CSNA mRNA that can be measured 
by any method known in the art or the level of transcription that can be measured by any 
method known in the art in a cell, tissue, organ or whole patient. Similarly, the term 
"expression of a CSP" or "CSP expression" means the amount of CSP that can be 
measured by any method known in the art or the level of translation of a CSNA that can be 

25 measured by any method known in the art. 

The present invention provides methods for diagnosing colon cancer in a patient, 
in particular adenocarcinoma, by analyzing for changes in levels of CSNA or CSP in cells, 
tissues, organs or bodily fluids compared with levels of CSNA or CSP in cells, tissues, 
organs or bodily fluids of preferably the same type from a normal human control, wherein 

30 an increase, or decrease in certain cases, in levels of a CSNA or CSP in the patient versus 
the normal human control is associated with the presence of colon cancer or with a 
predilection to the disease. In another preferred embodiment, the present invention 
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provides methods for diagnosing colon cancer in a patient by analyzing changes in the 
structure of the mRNA of a CSG compared to the mRNA from a normal control. These 
changes include, without limitation, aberrant splicing, alterations in polyadenylation 
and/or alterations in 5* nucleotide capping. In yet another preferred embodiment, the 
5 present invention provides methods for diagnosing colon cancer in a patient by analyzing 
changes in a CSP compared to a CSP from a normal patient. These changes include, e.g., 
alterations, including post translational modifications such as glycosylation and/or 
phosphorylation of the CSP or changes in the subcellular CSP localization. 

For purposes of the present invention, diagnosing means that CSNA or CSP levels 

10 are used to determine the presence or absence of disease in a patient. As will be 

understood by those of skill in the art, measurement of other diagnostic parameters may be 
required for definitive diagnosis or determination of the appropriate treatment for the 
disease. The determination may be made by a clinician, a doctor, a testing laboratory, or a 
patient using an over the counter test. The patient may have symptoms of disease or may 

1 5 be asymptomatic. In addition, the CSNA or CSP levels of the present invention may be 
used as screening marker to determine whether further tests or biopsies are warranted. In 
addition, the CSNA or CSP levels may be used to determine the Milnerability or 
susceptibilitj' to disease. 

In a prefeiTed embodiment, the expression of a CSNA is measured by determining 

20 the amount of a mRNA that encodes an amino acid sequence selected from SEQ ID NO: 
96-237, a homolog, an allelic variant, or a fragment thereof In a more preferred 
embodiment, the CSNA expression that is measured is the level of expression of a CSNA 
mRNA selected from SEQ ID NO: 1-95, or a hybridizing nucleic acid, homologous 
nucleic acid or allelic variant thereof, or a part of any of these nucleic acid molecules. 

25 CSNA expression may be measured by any method known in the art, such as those 

described supra, including measuring mRNA expression by Northern blot, quantitative or 
qualitative reverse transcriptase PCR (RT-PCR), microarray, dot or slot blots or in situ 
hybridization. See, e.g., Ausubel (1992), supra: Ausubel (1999), supra; Sambrook 
(1989), supra; and Sambrook (2001), supra. CSNA transcription may be measured by any 

30 method known in the art including using a reporter gene hooked up to the promoter of a 
CSG of interest or doing nuclear run-off assays. Alterations in mRNA structure, e.g., 
-aberrant splicing variants,-may be determined by any method known in the art, including, 
RT-PCR followed by sequencing or restriction analysis. As necessary, CSNA expression 
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may be compared to a known control, such as normal colon nucleic acid, to detect a 
change in expression. 

In another preferred embodiment, the expression of a CSP is measured by 
determining the level of a CSP having an amino acid sequence selected from the group 
5 consisting of SEQ ID NO: 96-237, a homolog, an allelic variant, or a fragment thereof. 
Such levels are preferably determined in at least one of cells, tissues, organs and/or bodily 
fluids, including determination of normal and abnormal levels. Thus, for instance, a 
diagnostic assay in accordance with the invention for diagnosing over- or underexpression 
of a CSNA or CSP compared to normal control bodily fluids, cells, or tissue samples may 
10 be used to diagnose the presence of colon cancer. The expression level of a CSP may be 
determined by any method known in the art, such as those described supra. In a preferred 
embodiment, the CSP expression level may be determined by radioimmunoassays, 
competitive-binding assays, ELISA, Western blot, FACS, immunohistochemistiy, 
immunoprecipitation, proteomic approaches: two-dimensional gel electrophoresis (2D 
15 electrophoresis) and non-gel-based approaches such as mass spectrometry or protein 

interaction profiling. See, e.g, Harlow (1999), supra\ Ausubel (1992), supra\ and Ausubel 
(1999), supra. Alterations in the CSP structure may be determined by any method known 
in the art, including, eg,, using antibodies that specifically recognize phosphoserine, 
phosphothreonine or phosphotyrosine residues, two-dimensional polyacrylamide gel 
electrophoresis (2D PAGE) and/or chemical analysis of amino acid residues of the protein. 
Id, 

In a preferred embodiment, a radioimmunoassay (RIA) or an ELISA is used. An 
antibody specific to a CSP is prepared if one is not already available. In a preferred 
embodiment, the antibody is a monoclonal antibody. The anti-CSP antibody is bound to a 
solid support and any free protein binding sites on the solid support are blocked with a 
protein such as bovine serum albumin. A sample of interest is incubated with flie antibody 
on the solid support under conditions in which the CSP will bind to the anti-CSP antibody. 
The sample is removed, the solid support is washed to remove unbound material, and an 
anti-CSP antibody that is linked to a detectable reagent (a radioactive substance for RIA 
and an enzyme for ELISA) is added to the solid support and incubated under conditions in 
which binding of the CSP to the labeled antibody will occur. After binding, the unbound 
labeled, antibody ia removed by washing. For an ELISA, one oi; more substrates, are added 
to produce a colored reaction product that is based upon the amount of a CSP in the 



wo 2004/050860 



PCT/US2003/040063 



108 

sample. For an RIA, the solid support is counted for radioactive decay signals by any 
method known in the art. Quantitative results for both RIA and ELISA typically are 
obtained by reference to a standard curve. 

Other methods to measure CSP levels are known in the art. For instance, a 
5 competition assay may be employed wherein an anti-CSP antibody is attached to a solid 
support and an allocated amount of a labeled CSP and a sample of interest are incubated 
with the solid support. The amount of labeled CSP attached to the solid support can be 
correlated to the quantity of a CSP in the sample. 

Of the proteomic approaches, 2D PAGE is a well known technique. Isolation of 

10 individual proteins from a sample such as serum is accomplished using sequential 

separation of proteins by isoelectric point and molecular weight. Typically, polypeptides 
are first separated by isoelectric point (the first dimension) and then separated by size 
using an electric current (the second dimension). In general, the second dimension is 
perpendicular to the first dimension. Because no two proteins with different sequences are 

15 identical on the basis of both size and charge, the result of 2D PAGE is a roughly square 
gel in which each protein occupies a unique spot. Analysis of the spots with chemical or 
antibody probes, or subsequent protein microsequencing can reveal the relative abundance 
of a given protein and the identity of the proteins in the sample. 

Expression levels of a CSNA can be determined by any method known in the art, 

20 including PCR and other nucleic acid methods, such as ligase chain reaction (LCR) and 
nucleic acid sequence based amplification (NASBA), can be used to detect malignant cells 
for diagnosis and monitoring of various malignancies. For example, reverse-transciiptase 
PCR (RT-PCR) is a powerful technique which can be used to detect the presence of a 
specific mRNA population in a complex mixture of thousands of other mRNA species. In 

25 RT-PCR, an mRNA species is first reverse transcribed to complementary DNA (cDNA) 
with use of the enzyme reverse transcriptase; the cDNA is then amplified as in a standard 
PCR reaction. 

Hybridization to specific DNA molecules oligonucleotides) arrayed on a 
solid support can be used to both detect the expression of and quantitate the level of 
30 expression of one or more CSNAs of interest. In this approach, all or a portion of one or 
more CSNAs is fixed to a substrate. A sample of interest, which may comprise RNA, e.g., 
•total RNA or polyA-selected mRNA, or a complementary DNA (cDNA) copy of the RNA 
is incubated with the solid support under conditions in which hybridization \vill occur 
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between the DNA on the solid support and the nucleic acid molecules in the sample of 
interest. Hybridization between the substrate-bound DNA and the nucleic acid molecules 
in the sample can be detected and quantitated by several means, including, without 
limitation, radioactive labeling or fluorescent labeling of the nucleic acid molecule or a 
5 secondary molecule designed to detect the hybrid. 

The above tests can be carried out on samples derived from a variety of cells, 
bodily fluids and/or tissue extracts such as homogenates or solubilized tissue obtained 
from a patient. Tissue extracts are obtained routinely from tissue biopsy and autopsy 
material. Bodily fluids useful in the present invention include blood, urine, saliva or any 

10 other bodily secretion or derivative thereof. As used herein "blood" includes whole blood, 
plasma, serum, circulating epithehal cells, constituents, or any derivative of blood. 

In addition to detection in bodily fluids, the proteins and nucleic acids of the 
invention are suitable to detection by cell capture technology. Whole cells may be 
captured by a variety methods for example magnetic separation, such as described in U.S. 

15 Patent. Nos. 5,200,084; 5,186,827; 5,108,933; and 4,925,788, the disclosures of which are 
incorporated herein by reference in their entireties. Epithelial cells may be captured using 
such products as Dynabeads® or CELLection™ (Dynal Biotech, Oslo, Norway). 
Alternatively, fractions of blood maybe captured, e.g., the buffy coat fraction (50mm cells 
isolated from 5ml of blood) containing epithelial cells. In addition, cancer cells may be 

20 captured using the techniques described in WO 00/47998, the disclosure of which is 
incorporated herein by reference in its entirety. Once the cells are captured or 
concentrated, the proteins or nucleic acids are detected by the means described in the 
subject application. Alternatively, nucleic acids may be captured directly from blood 
samples, see U.S. Patent Nos. 6,156,504, 5,501,963; or WO 01/42504 , the disclosures of 

25 which are incorporated herein by reference in their entireties. 

In a preferred embodiment, the specimen tested for expression of CSNA or CSP 
includes without limitation colon tissue, fecal samples, colonocytes, colon cells grown in 
cell culture, blood, serum, lymph node tissue, and lymphatic fluid In another preferred 
embodiment, especially when metastasis of a primary colon cancer is known or suspected, 

30 specimens include, without limitation, tissues from brain, bone, bone marrow, liver, lungs, 
and adrenal glands. In general, the tissues may be sampled by biopsy, including, without 
limitation, needje biopsy, e.g., transthoracic needle aspiration, cervical mediatinoscopy. 
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endoscopic lymph node biopsy, video-assisted thoracoscopy, exploratory thoracotomy, 
bone marrow biopsy and bone marrow aspiration. 

Colonocytes represent an important source of the CSP or CSNA because they 
provide a picture of the immediate past metabolic history of the GI tract of a subject. In 
5 addition, such cells are representative of the cell population from a statistically large 
sampling frame reflecting the state of the colonic mucosa along the entire length of the 
colon in a non-invasive manner, in contrast to a limited sampling by colonic biopsy using 
an invasive procedure involving endoscopy. Specific examples of patents describing the 
isolation of colonocytes include U.S. Patent Nos. 6,335,193; 6,020,137 5,741,650; 

10 6,258,541; US 2001 0026925 Al; WO 00/63358 Al, the disclosures of which are 
incorporated herein by reference in their entireties. 

All the methods of the present invention may optionally include determining the 
expression levels of one or more other cancer markers in addition to determining the 
expression level of a CSNA or CSP. In many cases, the use of another cancer marker will 

15 decrease the likelihood of false positives or false negatives. In one embodiment, the one 
or more other cancer markers include other CSNAs or CSPs as disclosed herein. Other 
cancer markers useful in the present invention will depend on the cancer being tested and 
are known to those of skill in the art. In a preferred embodiment, at least one other cancer 
marker in addition to a particular CSNA or CSP is measured. In a more preferred 

20 embodiment, at least tv^'o other additional cancer markers are used. In an even more 

preferred embodiment, at least three, more preferably at least five, even more preferably at 
least ten additional cancer markers are used. 

Diagnosing 

25 In one aspect, the invention provides a method for determining the expression 

levels and/or structural alterations of one or more CSNA and/or CSP in a sample from a 
patient suspected of having colon cancer. In general, the method comprises the steps of 
obtaining the sample from the patient, determining the expression level or structural 
alterations of a CSNA and/or CSP and then ascertaining whether the patient has colon 

30 cancer from the expression level of the CSNA or CSP. In general, if high expression 

relative to a control of a CSNA or CSP is indicative of colon cancer, a diagnostic assay is 
considered positive if the level of expression of the CSNA or CSP is at least one and a half 
times higher, and more preferably are at least two times higher, still more preferably five 
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times higher, even more preferably at least ten times higher, than in preferably the same 
cells, tissues or bodily fluid of a normal human control. In contrast, if low expression 
relative to a control of a CSNA or CSP is indicative of colon cancer, a diagnostic assay is 
considered positive if the level of expression of the CSNA or CSP is at least one and a half 
5 times lower, and more preferably are at least two times lower, still more preferably five 
times lower, even more preferably at least ten times lower than in preferably the same 
cells, tissues or bodily fluid of a normal human control. The normal human control may 
be from a different patient or from uninvolved tissue of the same patient. 

The present invention also provides a method of determining whether colon cancer 
1 0 has metastasized in a patient. One may identify whether the colon cancer has metastasized 
by measuring the expression levels and/or structural alterations of one or more CSNAs 
and/or CSPs in a variety of tissues. The presence of a CSNA or CSP in a tissue other than 
colon at levels higher than that of corresponding noncancerous tissue the same tissue 
from another individual) is indicative of metastasis if high level expression of a CSNA or 

1 5 CSP is associated with colon cancer. Similarly, the presence of a CSNA or CSP in a 
tissue other than colon at levels lower than that of corresponding noncancerous tissue is 
indicative of metastasis if low level expression of a CSNA or CSP is associated with colon 
cancer. Further, the presence of a structurally altered CSNA or CSP that is associated 
with colon cancer is also indicative of metastasis. 

20 In general, if high expression relative to a control of a CSNA or CSP is indicative 

of metastasis, an assay for metastasis is considered positive if flie level of expression of 
the CSNA or CSP is at least one and a half times higher, and more preferably are at least 
two times higher, still more preferably five times higher, even more preferably at least ten 
times higher, than in preferably the same cells, tissues or bodily fluid of a normal human 

25 control. In contrast, if low expression relative to a control of a CSNA or CSP is indicative 
of metastasis, an assay for metastasis is considered positive if the level of expression of 
the CSNA or CSP is at least one and a half times lower, and more preferably are at least 
two times lower, still more preferably five times lower, even more preferably at least ten 
times lower than in preferably the same cells, tissues or bodily fluid of a normal human 

30 control. 



Staging 
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The invention also provides a method of staging colon cancer in a human patient. 
The method comprises identifying a human patient having colon cancer and analyzing 
cells, tissues or bodily fluids from such human patient for expression levels and/or 
structural alterations of one or more CSNAs or CSPs. First, one or more tumors from a 
5 variety of patients are staged according to procedures well known in the art, and the 

expression levels of one or more CSNAs or CSPs is determined for each stage to obtain a 
standard expression level for each CSNA and CSP. Then, the CSNA or CSP expression 
levels of the CSNA or CSP are determined in a biological sample from a patient whose 
stage of cancer is not known. The CSNA or CSP expression levels from the patient are 
10 then compared to the standard expression level. By comparing the expression level of the 
CSNAs and CSPs from the patient to the standard expression levels, one may determine 
the stage of the tumor. The same procedure may be followed using structural alterations 
of a CSNA or CSP to determine the stage of a colon cancer. 

Momtoring 

1 5 Further provided is a method of monitoring colon cancer in a human patient. One 

may monitor a human patient to determine whether there has been metastasis and, if there 
has been, when metastasis began to occur. One may also monitor a human patient to 
determine whether a preneoplastic lesion has become cancerous. One may also monitor a 
human patient to determine whether a therapy, e,g,, chemotherapy, radiotherapy or 

20 surgery, has decreased or eliminated the colon cancer. The monitoring may detemine if 
there has been a reoccurrence and, if so, determine its nature. The method comprises 
identifying a human patient that one wants to monitor for colon cancer, periodically 
analyzing cells, tissues or bodily fluids from such human patient for expression levels of 
one or more CSNAs or CSPs, and comparing the CSNA or CSP levels over time to those 

25 CSNA or CSP expression levels obtained previously. Patients may also be monitored by 
measuring one or more structural alterations in a CSNA or CSP that are associated with 
colon cancer. 

If increased expression of a CSNA or CSP is associated with metastasis, treatment 
failure, or conversion of a preneoplastic lesion to a cancerous lesion, then detecting an 
30 increase in the expression level of a CSNA or CSP indicates that the tumor is 

metastasizing, that treatment has failed or that the lesion is cancerous, respectively. One 
having ordinary skill in the art would recognize that if this were the case, then a decreased 
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expression level would be indicative of no metastasis, effective therapy or failure to 
progress to a neoplastic lesion. If decreased expression of a CSNA or CSP is associated 
with metastasis, treatment failure, or conversion of a preneoplastic lesion to a cancerous 
lesion, then detecting a decrease in the expression level of a CSNA or CSP indicates that 
5 the tumor is metastasizing, that treatment has failed or that the lesion is cancerous, 
respectively. In a preferred embodiment, the levels of CSNAs or CSPs are determined 
from the same cell type, tissue or bodily fluid as prior patient samples. Monitoring a 
patient for onset of colon cancer metastasis is periodic and preferably is done on a 
quarterly basis, but may be done more or less frequently. 

10 The methods described herein can further be utilized as prognostic assays to 

identify subjects having or at risk of developing a disease or disorder associated with 
increased or decreased expression levels of a CSNA and/or CSP. The present invention 
provides a method in which a test sample is obtained from a human patient and one or 
more CSNAs and/or CSPs are detected. The presence of higher (or lower) CSNA or CSP 

15 levels as compared to normal human controls is diagnostic for the human patient being at 
risk for developing cancer, particularly colon cancer. The effectiveness of therapeutic 
agents to decrease (or increase) expression or activity of one or more CSNAs and/or CSPs 
of the invention can also be monitored by analyzing levels of expression of the CSNAs 
and/or CSPs in a human patient in clinical trials or in in vitro screening assays such as in 

20 human cells. In this way, the gene expression pattern can serve as a marker, indicative of 
the physiological response of the human patient or cells, as the case may be, to the agent 
being tested. 

Detection of Genetic Lesions or Aginations 

The methods of the present invention can also be used to detect genetic lesions or 
25 mutations in a CSG, thereby determining if a human with the genetic lesion is susceptible 
to developing colon cancer or to determine what genetic lesions are responsible, or are 
partly responsible, for a person's existing colon cancer. Genetic lesions can be detected, 
for example, by ascertaining the existence of a deletion, insertion and/or substitution of ■ 
one or more nucleotides from the CSGs of this invention, a chromosomal rearrangement 
30 of a CSG, an aberrant modification of a CSG (such as of the methylation pattern of the 
genomic DNA), or allelic loss of a CSG. Methods to detect such lesions in the CSG of 



wo 2004/050860 



PCT/US2003/040063 



114 

this invention are known to those having ordinary skill in the art following the teachings 
of the specification. 

Methods of Detecting Noncancerous Colon Diseases 

The present invention also provides methods for determining the expression levels 
5 and/or structural alterations of one or more CSNAs and/or CSPs in a sample from a patient 
suspected of having or known to have a noncancerous colon disease. In general, the 
method comprises the steps of obtaining a sample from the patient, determining the 
expression level or structural alterations of a CSNA and/or CSP, comparing the expression 
level or structural alteration of the CSNA or CSP to a normal colon control, and then 

10 ascertaining whether the patient has a noncancerous colon disease. In general, if high 
expression relative to a control of a CSNA or CSP is indicative of a particular 
noncancerous colon disease, a diagnostic assay is considered positive if the level of 
expression of the CSNA or CSP is at least two times higher, and more preferably are at 
least five times higher, even more preferably at least ten times higher, than in preferably 

15 the same cells, tissues or bodily fluid of a normal human control. In contrast, if low 
expression relative to a control of a CSNA or CSP is indicative of a noncancerous colon 
disease, a diagnostic assay is considered positive if the level of expression of the CSNA or 
CSP is at least two times lower, more preferably are at least five times lower, even more 
preferably at least ten times lower than in preferably the same cells, tissues or bodily fluid 

20 of a normal human control. The normal human control may be from a different patient or 
from uninvolved tissue of the same patient. 

One having ordinary skill in the art may determine whether a CSNA and/or CSP is 
associated with a particular noncancerous colon disease by obtaining colon tissue from a 
patient having a noncancerous colon disease of interest and determining which CSNAs 

25 and/or CSPs are expressed in the tissue at either a higher or a lower level than in normal 
colon tissue. In another embodiment, one may determine whether a CSNA or CSP 
exhibits structural alterations in a particular noncancerous colon disease state by obtaining 
colon tissue from a patient having a noncancerous colon disease of interest and 
determining the structural alterations in one or more CSNAs and/or CSPs relative to 

30 normal colon tissue. 
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Methods for IdeBtifving Colon Tissue 

In another aspect, the invention provides methods for identifying colon tissue. 
These methods are particularly useful in, e.g., forensic science, colon cell differentiation 
and development, and in tissue engineering. 

5 In one embodiment, the invention provides a method for determining whether a 

sample is colon tissue or has colon tissue-like characteristics. The method comprises the 
steps of providing a sample suspected of comprising colon tissue or having colon tissue- 
like characteristics, determining whether the sample expresses one or more CSNAs and/or 
CSPs, and, if the sample expresses one or more CSNAs and/or CSPs, concluding that the 

10 sample comprises colon tissue. In a preferred embodiment, the CSNA encodes a 
polypeptide having an amino acid sequence selected from SEQ ID NO: 96-237, or a 
homolog, allelic variant or fragment thereof In a more preferred embodiment, the CSNA 
has a nucleotide sequence selected from SEQ ID NO: 1-95, or a hybridizing nucleic acid, 
an allelic variant or a part thereof Determining whether a sample expresses a CSNA can 

15 be accomplished by any method known in the art. Preferred methods include 

hybridization to microarrays, Northern blot hybridization, and quantitative or qualitative 
RT-PCR. In another preferred embodiment, the method can be practiced by determining 
whether a CSP is expressed. Determining whether a sample expresses a CSP can be 
accomplished by any method known in the art. Preferred methods include Western blot, 

20 ELISA, RIA and 2D PAGE. In one embodiment, the CSP has an amino acid sequence 
selected from SEQ ID NO: 96-237, or a homolog, allelic variant or fragment thereof. In 
another preferred embodiment, the expression of at least two CSNAs and/or CSPs is 
determined. In a more preferred embodiment, the expression of at least three, more 
preferably four and even more preferably five CSNAs and/or CSPs are determined. 

25 In one embodiment, the method can be used to determine whether an unknown 

tissue is colon tissue. This is particularly useful in forensic science, in which small, 
damaged pieces of tissues that are not identifiable by microscopic or other means are 
recovered from a crime or accident scene. In another embodiment, the method can be 
used to determine whether a tissue is differentiating or developing into colon tissue. This 

30 is important in monitoring the effects of the addition of various agents to cell or tissue 
culture, e.g., in producing new colon tissue by tissue engineering. These agents include, 
e.g., growth and differentiation factors, extracellular matrix proteins and culture medium. 
Other factors that may be measured for effects on tissue development and differentiation 
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include gene transfer into the cells or tissues, alterations in pH, aqueousrair interface and 
various other culture conditions. 

Methods for Producing and Modifying Colon Tissue 

In another aspect, the invention provides methods for producing engineered colon 
5 tissue or cells. In one embodiment, the method comprises the steps of providing cells, 
introducing a CSNA or a CSG into the cells, and growing the cells under conditions in 
which they exhibit one or more properties of colon tissue cells. In a preferred 
embodiment, the cells are pleuripotent. As is well known in the art, normal colon tissue 
comprises a large number of different cell types. Thus, in one embodiment, the 

1 0 engineered colon tissue or cells comprises one of these cell types. In another embodiment, 
the engineered colon tissue or cells comprises more than one colon cell type. Further, the 
culture conditions of the cells or tissue may require manipulation in order to achieve full 
differentiation and development of the colon cell tissue. Methods for manipulating culture 
conditions are well known in the art. 

15 Nucleic acid molecules encoding one or more CSPs.are introduced into cells, 

preferably pleuripotent cells. In a preferred embodiment, the nucleic acid molecules 
encode CSPs having amino acid sequences selected from SEQ ID NO: 96-237, or 
homologous proteins, analogs, allelic variants or fragments thereof In a more preferred 
embodiment, the nucleic acid molecules have a nucleotide sequence selected from SEQ ID 

20 NO: 1-95, or hybridizing nucleic acids, allelic variants or parts thereof In another highly 
preferred embodiment, a CSG is introduced into the cells. Expression vectors and 
methods of introducing nucleic acid molecules into cells are well known in the art and are 
described in detail, supra. 

ArtijBcial colon tissue may be used to treat patients who have lost some or all of 

25 their colon function. 

Pharmaceutical Compositions 

In another aspect, die invention provides pharmaceutical compositions comprising 
the nucleic acid molecules, polypeptides, fusion proteins, antibodies, antibody derivatives, 
antibody fragments, agonists, antagonists, or inhibitors of the present invention. In a 
30 preferred embodiment, the pharmaceutical composition comprises a CSNA or part thereof 
In a more preferred embodiment, the CSNA has a nucleotide sequence selected from the 
group consisting of SEQ ID NO: 1-95, a nucleic acid that hybridizes thereto, an allelic 
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variant thereof, or a nucleic acid that has substantial sequence identity thereto. In another 
preferred embodiment, the pharmaceutical composition comprises a CSP or fragment 
thereof. In a more preferred embodiment^ the pharmaceutical composition comprises a 
CSP having an amino acid sequence that is selected from the group consisting of SEQ ID 
5 NO: 96-237, a pol)peptide that is homologous thereto, a fusion protein comprising all or a 
portion of the polypeptide, or an analog or derivative thereof. In another preferred 
embodiment, the pharmaceutical composition comprises an anti-CSP antibody, preferably 
an antibody that specifically binds to a CSP having an amino acid that is selected from the 
group consisting of SEQ ID NO: 96-237, or an antibody that binds to a polypeptide that is 

1 0 homologous thereto, a fiision protein comprising all or a portion of the polypeptide, or an 
analog or derivative thereof. 

Due to the association of angiogenesis with cancer vascularization there is great 
need of nev^ markers and methods for diagnosing angiogenesis activity to identify 
developing tumors and angiogenesis related diseases. Furthermore, great need is also 

1 5 present for new molecular targets useful in the treatment of angiogenesis and angiogenesis 
related diseases such as cancer. In addition known modulators of angiogenesis such as 
endostatin or vascular endothelial growth factor (VEGF). Use of the methods and 
compositions disclosed herein in combination with anti-angiogenesis drugs, drugs that 
block the matrix breakdown (such as BMS-275291, Dalteparin (Fragmin®), Suramin), 

20 drugs that inhibit endothelial cells (2-methoxyestradiol (2-ME), CC-5013 (Thalidomide 
Analog), Combretastatin A4 Phosphate, LY317615 (Protein Kinase C Beta Inhibitor), Soy 
Isoflavone (Genistein; Soy Protein Isolate), Thalidomide), drugs that block activators of 
angiogenesis (AE-941 (Neovastat™; GW786034), Anti-VEGF Antibody (Bevacizumab; 
Avastin™), Interferon-alpha, PTK787/ZK 222584, VEGF-Trap, ZD6474), Drugs that 

25 inhibit endothelial-specific integrin/survival signaling (EMD 121974, Anti-Anb3 Integrin 
Antibody (Medi-522; Vitaxin™)). 

Such a composition typically contains from about 0.1 to 90% by weight of a 
therapeutic agent of the invention formulated in and/or with a phannaceutically acceptable 
carrier or excipient. 

30 Pharmaceutical formulation is a well-established art that is further described in 

Gennaro (ed.). Remington: The Science and Practice of Pharmacy. 20* ed., Lippincott, 
Williams &"Wilkins (2000); Ansel et al, Pharmaceutical Dosage Forms and Drug 
Delivery Systems . 7* ed., Lippincott Williams & Wilkins (1999); and Kibbe (ed.), 
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Handbook of Pharmaceutical Excipients American Pharmaceutical Association, 3"*^ ed. 
(2000) and thus need not be described in detail herein. 

Briefly, formulation of the pharmaceutical compositions of the present invention 
will depend upon the route chosen for administration. The pharmaceutical compositions 
5 utilized in this invention can be administered by various routes including both enteral and 
parenteral routes, including oral, intravenous, intramuscular, subcutaneous, inhalation, 
topical, sublingual, rectal, intra-arterial, intramedullary, intrathecal, intraventricular, 
transmucosal, transdermal, intranasal, intraperitoneal, intrapulmonary, and intrauterine. 
Oral dosage forms can be formulated as tablets, pills, dragees, capsules, liquids, 
10 gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. 

Solid formulations of the compositions for oral administration can contain suitable 
carriers or excipients, such as carbohydrate or protein fillers, such as sugars, including 
lactose, sucrose, mannitol, or sorbitol; starch from com, wheat, rice, potato, or other 
plants; cellulose, such as methyl cellulose, hydroxypropybnethyl-cellulose, sodium 
15 carboxymethylcellulose, or microcrystalline cellulose; gums including arabic and 
tragacanth; proteins such as gelatin and collagen; inorganics, such as kaolin, calcium 
carbonate, dicalcium phosphate, sodium chloride; and other agents such as acacia and 
alginic acid. 

Agents that facilitate disintegration and/or solubilization can be added, such as the 
cross-linked polyvinyl p>Trolidone, agar, alginic acid, or a salt thereof, such as sodium 
alginate, microcrystalline cellulose, cornstarch, sodium starch glycolate, and alginic acid. 

Tablet binders that can be used include acacia, methylcellulose, sodium 
carboxymethylcellulose, polyvinylpyrrolidone (Povidone™), hydroxypropyl 
methylcellulose, sucrose, starch and ethylcellulose. 

Lubricants that can be used include magnesium stearates, stearic acid, silicone 
fluid, talc, waxes, oils, and colloidal silica. 

Fillers, agents that facilitate disintegration and/or solubilization, tablet binders and 
lubricants, including the aforementioned, can be used singly or in combination. 

Solid oral dosage forms need not be uniform throughout. For example, dragee 
cores can be used in conjunction with suitable coatings, such as concentrated sugar 
solutions, which can also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, -and/or titanium dioxide, lacquer solutions, and suitable organic 
solvents or solvent mixtures. 
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Oral dosage forms of the present invention include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or 
sorbitol. Push-fit capsules can contain active ingredients mixed with a filler or binders, 
such as lactose or starches, lubricants, such as talc or magnesium stearate, and, optionally, 
5 stabilizers. In soft capsules, the active compounds can be dissolved or suspended in 
suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with or without 
stabilizers. 

Additionally, dyestuffs or pigments can be added to the tablets or dragee coatings 
for product identification or to characterize the quantity of active compound, i.e., dosage. 

10 Liquid formulations of the phannaceutical compositions for oral (enteral) 

administration are prepared in water or other aqueous vehicles and can contain various 
suspending agents such as methylcellulose, alginates, tragacanth, pectin, kelgin, 
carrageenan, acacia, polyvinylpyrrolidone, and polyvinyl alcohol. The liquid formulations 
can also include solutions, emulsions, syrups and elixirs containing, together with the 

15 active compound(s), wetting agents, sweeteners, and coloring and flavoring agents. 

The pharmaceutical compositions of the present invention can also be formulated 
for parenteral administration. Formulations for parenteral administration can be in the 
form of aqueous or non-aqueous isotonic sterile injection solutions or suspensions. 

For intravenous injection, water soluble versions of the compounds of the present 

20 invention are formulated in, or if provided as a lyophilate, mixed with, a physiologically 
acceptable fluid vehicle, such as 5% dextrose ("D5"), physiologically buffered saline, 
0.9% saline, Hanks' solution, or Ringer's solution. Intravenous formulations may include 
carriers, excipients or stabilizers including, without limitation, calcium, human serum 
albumin, citrate, acetate, calcium chloride, carbonate, and other salts. 

25 Intramuscular preparations, e,g. a sterile formulation of a suitable soluble salt form 

of the compounds of the present invention, can be dissolved and administered in a 
pharmaceutical excipient such as Water-for-Injection, 0.9% saline, or 5% glucose solution. 
Alternatively, a suitable insoluble form of the compound can be prepared and 
administered as a suspension in an aqueous base or a pharmaceutically acceptable oil base, 

30 such as an ester of a long chain fatty acid (e.g., ethyl oleate), fatty oils such as sesame oil, 
triglycerides, or liposomes. 

Parenteral fonnulations of the compositions can contain various carriers such as 
vegetable oils, dimethylacetamide, dimethylformamide, ethyl lactate, ethyl carbonate, 
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isopropyl myristate, ethanol, polyols (glycerol, propylene glycol, liquid polyethylene 
glycol, and the like). 

Aqueous injection suspensions can also contain substances that increase the 
viscosity of the suspension, such as sodium carboxyniethyl cellulose, sorbitol, or dextran. 
5 Non-lipid polycationic amino polymers can also be used for delivery. Optionally, the 
suspension can also contain suitable stabilizers or agents that increase the solubility of the 
compounds to allow for the preparation of highly concentrated solutions. 

Pharmaceutical compositions of the present invention can also be formulated to 
permit injectable, long-term, deposition. Injectable depot forms may be made by forming 

10 microencapsulated matrices of the compound in biodegradable polymers such as 

polylactide-polyglycolide. Depending upon the ratio of drug to polymer and the nature of 
the particular polymer employed, the rate of drug release can be controlled. Examples of 
other biodegradable polymers include poly(orthoesters) and poly(anhydrides). Depot 
injectable formulations are also prepared by entrapping the drug in microemulsions that 

1 5 are compatible with body tissues. 

The pharmaceutical compositions of the present invention can be administered 
topically. For topical use the compounds of the present invention can also be prepared in 
suitable forms to be applied to the skin, or mucus membranes of the nose and throat, and 
can take the form of lotions, creams, ointments, liquid sprays or inhalants, drops, tinctures, 

20 lozenges, or throat paints. Such topical formulations further can include chemical 
compounds such as dimethylsulfoxide (DMSO) to facilitate surface penetration of the 
active ingredient. In other transdermal formulations, typically in patch-delivered 
formulations, the pharmaceutically active compound is formulated with one or more skin 
penetrants, such as 2-N-methyl-pyrrolidone (NMP) or Azone. A topical semi-solid 

25 ointment formulation typically contains a concentration of the active ingredient from 
about 1 to 20%, e.g., 5 to 10%, in a carrier such as a pharmaceutical cream base. 

For application to the eyes or ears, the compounds of the present invention can be 
presented in liquid or semi-liquid form formulated in hydrophobic or hydrophilic bases as 
ointments, creams, lotions, paints or powders. 

30 For rectal administration the compounds of the present invention can be 

administered in the form of suppositories admixed with conventional carriers such as 
cocoa butter, wax or other glyceride. 
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Inhalation formulations can also readily be formulated. For inhalation, various 
powder and liquid formulations can be prepared. For aerosol preparations, a sterile 
formulation of the compound or salt form of the compound may be used in inhalers, such 
as metered dose inhalers, and nebulizers. Aerosolized forms may be especially useful for 
5 treating respiratory disorders. 

Alternatively, the compounds of the present invention can be in powder form for 
reconstitution in the appropriate pharmaceutically acceptable carrier at the time of 
delivery. 

The pharmaceutically active compound in the pharmaceutical compositions of the 
10 present invention can be provided as the salt of a variety of acids, including but not limited 
to hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic acid. Salts tend to be 
more soluble in aqueous or other protonic solvents than are the corresponding free base 
forms. 

After pharmaceutical compositions have been prepared, they are packaged in an 
1 5 appropriate container and labeled for treatment of an indicated condition. 

The active compound will be present in an amount effective to achieve the 
intended purpose. The determination of an effective dose is well within the capability of 
those skilled in the art. 

A "therapeutically effective dose" refers to that amount of active ingredient, for 
20 example CSP polypeptide, fusion protein, or fragments thereof, antibodies specific for 
CSP, agonists, antagonists or inhibitors of CSP, which ameliorates the signs or symptoms 
of the disease or prevent progression thereof; as would be understood in the medical arts, 
cure, although desired, is not required. 

The therapeutically effective dose of the pharmaceutical agents of the present 
25 invention can be estimated initially by m vitro tests, such as cell culture assays, followed 
by assay in model animals, usually mice, rats, rabbits, dogs, or pigs. The animal model 
can also be used to determine an initial preferred concentration range and route of 
administration. 

For example, the ED50 (the dose therapeutically effective in 50% of the 
30 population) and LD50 (the dose lethal to 50% of the population) can be determined in one 
or more cell culture of animal model systems. The dose ratio of toxic to therapeutic 
effects is the therapeutic index, which can be expressed as LD5O/ED50. Pharmaceutical 
compositions that exhibit large therapeutic indices are preferred. 
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The data obtained from cell culture assays and animal studies are used in 
fonnulating an initial dosage range for human use, and preferably provide a range of 
circulating concentrations that includes the ED50 with little or no toxicity. After 
administration, or between successive administrations, the circulating concentration of 
5 active agent varies within this range depending upon pharmacokinetic factors well known 
in the art, such as the dosage form employed, sensitivity of the patient, and the route of 
administration. 

The exact dosage will be determined by the practitioner, in light of factors specific 
to the subject requiring treatment. Factors that can be taken into account by the 

10 practitioner include the severity of the disease state, general health of the subject, age, 
weight, gender of the subject, diet, time and frequency of administration, drug 
combination(s), reaction sensitivities, and tolerance/response to therapy. Long-acting 
pharmaceutical compositions can be administered every 3 to 4 days, every week, or once 
every two weeks depending on half-life and clearance rate of the particular formulation. 

15 Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to a total 

dose of about 1 g, depending upon the route of administration. Where the therapeutic 
agent is a protein or antibody of the present invention, the therapeutic protein or antibody 
agent typically is administered at a daily dosage of 0.01 mg to 30 mg/kg of body weight of 
the patient (e.g., Img/kg to 5 mg/kg). The pharmaceutical formulation can be 

20 administered in multiple doses per day, if desired, to achieve the total desired daily dose. 
Guidance as to particular dosages and methods of delivery is provided in the 
literature and generally available to practitioners in the art. Those skilled in the art will 
employ different fomiulations for nucleotides than for proteins or their inhibitors. 
Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

25 conditions, locations, etc. 

Conventional methods, known to those of ordinary skill in the art of medicine, can 
be used to administer flie pharmaceutical formulation(s) of the present invention to the 
patient. The pharmaceutical compositions of the present invention can be administered 
alone, or in combination wifli other therapeutic agents or interventions. 



30 



Therapeutic Methods 

The present invention fiirther provides methods of treating subjects having defects 
in a gene of the invention, e,g,, in expression, activity, distribution, localization, and/or 
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solubility, which can manifest as a disorder of colon function. As used herein, "treating" 
includes all medically-acceptable types of therapeutic intervention, including palliation 
and prophylaxis (prevention) of disease. The term "treating" encompasses any 
improvement of a disease, including minor improvements. These methods are discussed 
5 below. 

Gene Tlierapy and Vaccines 

The isolated nucleic acids of the present invention can also be used to drive in vivo 
expression of the polypeptides of the present invention. In vivo expression can be driven 
from a vector, typically a viral vector, often a vector based upon a replication incompetent 

10 retrovirus, an adeno\arus, or an adeno-associated virus (AAV), for the purpose of gene 
therapy. In vivo expression can also be driven from signals endogenous to the nucleic acid 
or from a vector, often a plasmid vector, such as pVAXl (Invitrogen, Carlsbad, CA, 
USA), for purpose of "naked" nucleic acid vaccination, as further described in U.S. Patent 
Nos. 5,589,466; 5,679,647; 5,804,566; 5,830,877; 5,843,913; 5,880,104; 5,958,891; 

15 5,985,847; 6,017,897; 6,1 10,898; 6,204,250, the disclosures of which are incorporated 
herein by reference in their entireties. For cancer therapy, it is preferred that the vector 
also be tumor-selective. See, e,g„ Doronin et al, J, Virol 75: 3314-24 (2001). 

In another embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising a nucleic 

20 acid molecule of the present invention is administered. The nucleic acid molecule can be 
delivered in a vector that drives expression of a CSP, fusion protein, or fragment thereof, 
or without such vector. Nucleic acid compositions that can drive expression of a CSP are 
administered, for example, to complement a deficiency in the native CSP, or as DNA 
vaccines. Expression vectors derived from virus, replication deficient retroviruses, 

25 adenovirus, adeno-associated (AAV) virus, herpes virus, or vaccinia virus can be used as 
can plasmids. See, e.g., Cid-Airegui, supra. In a preferred embodiment, the nucleic acid 
molecule encodes a CSP having the amino acid sequence of SEQ ID NO: 96-237, or a 
fragment, fiision protein, allelic variant or homolog thereof. 

In still other therapeutic methods of the present invention, pharmaceutical 

30 compositions comprising host cells that express a CSP, fusions, or fragments thereof can 
be administered. In such cases, the cells are typically autologous, so as to circumvent 
xenogeneic or allotypic rejection, and are administered* to complement defects in CSP 
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production or activity. In a preferred embodiment, the nucleic acid molecules in the cells 
encode a CSP having the amino acid sequence of SEQ ID NO: 96-237, or a fragment, 
fusion protein, allelic variant or homolog thereof. 

Antisense Administration 
5 Antisense nucleic acid compositions, or vectors that drive expression of a CSG 

antisense nucleic acid, are administered to downregulate transcription and/or translation of 
a CSG in circumstances in which excessive production, or production of aberrant protein, 
is the pathophysiologic basis of disease. 

Antisense compositions useful in therapy can have a sequence that is 

1 0 complementary to coding or to noncoding regions of a CSG. For example, 

oligonucleotides derived from the transcription initiation site, e.g., between positions -10 
and +10 from the start site, are preferred. 

Catalytic antisense compositions, such as ribo2ymes, that are capable of 
sequence-specific hybridization to CSG transcripts, are also useful in therapy. See, eg., 

15 Phylactou, Adv. DnigDeliv. Rev, 44(2-3): 97-108 (2000); Phylactou et al, Hum, Mol 
Genet. 7(10): 1649-53 (1998); Rossi, Ciba Found Symp. 209: 195-204 (1997); and 
Sigurdsson et al. Trends Biotechnol 13(8): 286-9 (1995). 

Other nucleic acids useful in the therapeutic methods of the present invention are 
those that are capable of triplex helix formation in or near the CSG genomic locus. Such 

20 triplexing oligonucleotides are able to inhibit transcription. See» e.g., Intody et aL, Nucleic 
Acids Res. 28(21): 4283-90 (2000); and McGuffie et ai, Cancer Res. 60(14): 3790-9 
(2000). Pharmaceutical compositions comprising such triplex forming oligos (TFOs) are 
administered in circumstances in which excessive production, or production of aberrant 
protein, is a pathophysiologic basis of disease. 

25 In a preferred embodiment, the antisense molecule is derived from a nucleic acid 

molecule encoding a CSP, preferably a CSP comprising an amino acid sequence of SEQ 
ID NO: 96-237, or a fragment, allelic variant or homolog thereof. In a more preferred 
embodiment, the antisense molecule is derived from a nucleic acid molecule having a 
nucleotide sequence of SEQ ID NO: 1-95, or a part, allelic variant, substantially similar or 

30 hybridizing nucleic acid thereof. 
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In one embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising a CSP, a 
fusion protein, fragment, analog or derivative thereof is administered to a subject with a 
clinically-significant CSP defect. 
5 Protein compositions are administered, for example, to complement a deficiency in 

native CSP. In other embodiments, protein compositions are administered as a vaccine to 
elicit a humoral and/or cellular immune response to CSP. The immune response can be 
used to modulate activity of CSP Oj^ depending on the immunogen, to immunize against 
aberrant or aberrantly expressed forms, such as mutant or inappropriately expressed 

10 isoforms. In yet other embodiments, protein fusions having a toxic moiety are 
administered to ablate cells that aberrantly accumulate CSP. 

In a preferred embodiment, the polypeptide administered is a CSP comprising an 
amino acid sequence of SEQ ID NO: 96-237, or a fusion protein, allelic variant, homolog, 
analog or derivative thereof. In a more preferred embodiment, the polypeptide is encoded 

15 by a nucleic acid molecule having a nucleotide sequence of SEQ ID NO: 1-95, or a part, 
allelic variant, substantially similar or hybridizing nucleic acid thereof. 

Antibody, Agonist and Antagonist Administration 

In another embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising an antibody 

20 (including fragment or derivative thereof) of the present invention is administered. As is 
well known, antibody compositions are administered, for example, to antagonize activity 
of CSP, or to target therapeutic agents to sites of CSP presence and/or accumulation. In a 
preferred embodiment, the antibody specifically binds to a CSP comprising an amino acid 
sequence of SEQ ID NO: 96-237, or a fusion protein, allelic variant, homolog, analog or 

25 derivative thereof. In a more preferred embodiment, the antibody specifically binds to a 
CSP encoded by a nucleic acid molecule having a nucleotide sequence of SEQ ID NO: 1- 
95, or a part, allelic variant, substantially similar or hybridizing nucleic acid thereof. 

The present invention also provides methods for identifying modulators which 
bind to a CSP or have a modulatory effect on the expression or activity of a CSP. 

30 Modulators which decrease the expression or activity of CSP (antagonists) are believed to 
be useful in treating colon cancer. Such screening assays are known to those of skill in the 
art and' include, without limitation, cell-based assays and cell-free assays. Small molecules 
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predicted via computer imaging to specifically bind to regions of a CSP can also be 
designed, synthesized and tested for use in the imaging and treatment of colon cancer. . 
Further, libraries of molecules can be screened for potential anticancer agents by assessing 
the ability of the molecule to bind to the CSPs identified herein. Molecules identified in 
5 the library as being capable of binding to a CSP are key candidates for further evaluation 
for use in the treatment of colon cancer. In a preferred embodiment, these molecules will 
downregulate expression and/or activity of a CSP in cells. 

In another embodiment of the therapeutic methods of the present invention, a 
pharmaceutical composition comprising a non-antibody antagonist of CSP is administered. 
10 Antagonists of CSP can be produced using methods generally known in the art. In 
particular, purified CSP can be used to screen libraries of pharmaceutical agents, often 
combinatorial libraries of small molecules, to identify those that specifically bind and 
antagonize at least one activity of a CSP. 

In other embodiments a pharmaceutical composition comprising an agonist of a 
1 5 CSP is administered. Agonists can be identified using methods analogous to those used to 
identify antagonists. 

In a preferred embodiment, the antagonist or agonist specifically binds to and 
antagonizes or agonizes, respectively, a CSP comprising an amino acid sequence of SEQ 
ID NO: 96-237, or a fiision protein, allelic variant, homolog, analog or derivative thereof. 
20 In a more preferred embodiment, the antagonist or agonist specifically binds to and 

antagonizes or agonizes, respectively, a CSP encoded by a nucleic acid molecule having a 
nucleotide sequence of SEQ ID NO: 1-95, or a part, allelic variant, substantially similar or 
hybridizing nucleic acid thereof. 

25 Targeting Colon Tissue 

The invention also provides a method in which a polypeptide of the invention, or 
an antibody thereto, is linked to a therapeutic agent such that it can be delivered to the 
colon or to specific cells in the colon. In a preferred embodiment, an anti-CSP antibody is 
linked to a therapeutic agent and is administered to a patient in need of such therapeutic 

30 agent The therapeutic agent may be a toxin, if colon tissue needs to be selectively 

destroyed. This would be useful for targeting and killing colon cancer cells. In another 
embodiment, the- therapeutic agent may be a growth or differentiation factor, which would 
be useful for promoting colon cell function. 
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In another embodiment, an anti-CSP antibody may be linked to an imaging agent 
that can be detected using, e.g., magnetic resonance imaging, CT or PET. This would be 
useful for determining and monitoring colon function, identifying colon cancer tumors, 
and identifying noncancerous colon diseases. 

5 EXAMPLES 

Example la: Alternative Splice Variants 

We identified gene transcripts using the Gencarta™ tools (Compugen Ltd., Tel 
Aviv, Israel) and a variety of public and proprietary databases. These splice variants are 
either sequences which differ from a previously defined sequence or new uses of known 

10 sequences. In general related variants are annotated as DEX0448_XXX.nt.l, 

DEX0448_XXX.nt.2, DEX0448_XXX.nt.3, etc. The variant DNA sequences encode 
proteins which differ from a previously defined protein sequence. In relation to the 
nucleotide sequence naming convention, protein variants are annotated as 
DEX0448_XXX.aa.l, DEX0448_XXX.aa.2, etc., wherein transcript DEX0448_XXX.nt.l 

1 5 encodes protein DEX0448_XXX.aa. 1 . A single transcript may encode a protein from an 
alternate Open Reading Fram (ORF) which is designated DEX0448_XXX.orf.l . 
Additionally, multiple transcripts may encode for a single protein. In this case, 
DEX0448_XXX.nt.I and DEX0448_XXX.nt.2 will both be associated with 
DEX0448_XXX.aa.l. 

20 The mapping of the nucleic acid C^NT') SEQ ID NO; DEX ID; chromosomal 

location (if known); open reading frame (ORF) location; amino acid ("AA") SEQ ID NO; 
AA DEX ID; are shown in the table below. 



SEQ 

ID 

NO 


DEX ID 


Chrome Map 


ORF Loc 


SEQ 
ID 

NO 


DEX ID 


1 


DEX0448 OOl.nt.l 


6pl2.2 


1-547 


96 


DEX0448 OOl.aa.l 


1 


DEX044a OOl.nt.l 


6pl2.2 


212-538 


97 


DEX0448 OOl.orf.l 


2 


DEX0448_002.nt.l 


X;22899334- 
22902697 


1-323 


98 


DEX044 8_002.aa.l 


2 


DEX044 8_002.nt.l 


X;22899334- 
22902697 


66-310 


99 


DEX0448_002.orf .1 


3 


DEX0448 003.nt.l 


8p23 . 1 


250-924 


100 


DEX0448 003.aa.l 


3 


DEX0448 003.nt.l 


8p23 .1 


3-545 


101 


DEX0448 003.orf.l 




DEX0448 004.nt.l 


19ql3.2 


1-686 


102 


DEX0448 004.aa.l 




DEX0448 004.nt.l 


19ql3.2 


1-666 


103 


DEX0448 O04.orf.l 


5 1 


DEX0448 OOS.nt.l 


7q32.3 


206-431 


104 


DEX0448 OOS.aa.l 
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5 


DEX0448 OOR nt 1 


|7g32^3 


1/3 -427 


105 


|dEX044 8_005 . orf . 1 


5 




1 /q^z . ,3 


2Ub-431 


104 


|pEX0448 OOS.aa.l 






1*7 /tIO 1 


173-427 


106 


[dEX0448 005. orf. 2 


7 


uc»A \Jt*t a vjuo.riL.J. 


It o o 


1-200 


107 


[pEX0448 OOe.aa.l 


7 


npvn4 4fi nnc nt- t 
uc«Au^^o uuo-nc.j. 


jiqj . A 


3-200 


108 


|DEX0448 006, orf. 1 


g 






61-443 


109 


^EX0448_007 .aa . 1 


Q 




fxqzz 


3-353 


110 


[DEX0448_007 . orf . 1 


9 




|ZXq22 . 2 


172-676 


111 


|dEX044 8_008 . aa . 1 


9 




|zxqzz . z 


2-676 


112 


|dEX0448 008 . Orf . 1 


10 


DEX044a 00 Q n^ 1 


ll 7rrO I 1 

|x /qz J . J 


3-527 


113 


|DEX0448_009 . aa . 1 


11 


DEX0448 010 nh 1 


pqjz 


60-869 


114 


jDEA044 8 010 . aa . 1 


12 


nEX044fl Oil n*- 1 


|x /qz 1 . 


1-673 


115 


|dEX0448 Oll.aa.l 


12 


DF!X044ft mi ■n^ 1 


|i /qzi . 32 


311-673 


116 


|DEX044 8 oil .orf . 1 


13 




It *nt t o 
|lpl3 .3 


6-207 


117 


|dEX044 8_012 - aa . 1 


1 7 

X J 




[IplJ . 3 


18-260 


118 


[dEX044 8_012 .orf . 1 


14 




It T i-rl A 

|Xiqz4 . 2 


157-911 


119 


|DEX0448_013 .aa . 1 


14 




It t /I T 


279-887 


120 


|dEX0448 013. orf. 1 


15 


npy044P 014 n*- 1 


It t i-¥T *a A 
jxxqx3 * *± 


49-431 


121 


|dEX0448_014 .aa. 1 


15 




It 1 «T ^ A 
|Xxqi3 • 4 


3-431 


122 


|dEX0448 014 .orf . 1 


16 


r)l?X044R 01 ^ ni- 1 


ll 1 rt'\ *a ^ 

ixxqx3 . 4 


1-230 


123 


DEX0448 015.aa,l 


16 




1 1 o1 1 4 
X xqxo . *i 


"5 C C 1 

^ -DbX 


124 


TM?VA/t A O n r" JZ •» 

DEAU44 8 OlS.orr.l 


17 


DEX0448 016 n^ 1 


1 ^rr'^e; "i 
XDq^3 . ^ 


A cr Q C 

-5 - o y D 


125 


DEA0448 Olb.aa.l 


18 


nKX04 4fl 01 nt- 0 


x^qz^ . 3 


20y- o33 


126 


DEa0448 016. aa. 2 


18 


nKy044fl 01^^ nh O 


X Dq^3 . 3 


O A O O O 


127 


DEX0448 016. orf. 2 


19 




X pq^^i? . 3 


54-503 


128 


DEX0448 016. aa. 3 


20 


nFy044R 01*7 rtl- 1 


oqz^ . 3 


1-698 


129 


DEX0448 017.aa.l 


20 


npy04 4P 01*7 1 


oq24 . 3 


OA O O O 

80-892 


130 


DEX044 8 O17.orf.l 


21 


DEX0448 OlB.nt.l 


6p21.1 


133-1745 


131 


DEX0448 OlS.aa.l 


21 


DEX0448 OlB.nt.l 


6p2l.l 


842-1411 


132 


DEX0448 018. orf. 1 


22 


DBX0448 OlS.nt.l 


16q23.1 


46-378 


133 


DEX0448 019.aa.l 


23 


DEX0448 020.nt.l 


3q27.2 


1-783 


134 


DEX0448 020.aa.l 


24 


DEX0448 021.nt.l 


llpl5.5 


353-1435 


135 


DEX0448 021.aa.l 


25 


DEX0448 021. nt. 2 


llplS.S 


353-1339 


136 


DEX0448 021. aa. 2 


26 


DEX0448 021, nt. 3 


llplS . 5 


1 -492 


117 
XJ / 


npy044R O01 aa 1 


27 


DEX0448_022.nt.l 


5 DR51 '3849138- 
3852035 


2-203 


138 


DEX0448_022.aa.l 


z / 


U£A04 4 D^02 2 . nt . 1 


6_DR51;3849138- 
3852035 


214-618 


139 


DEX0448_022,orf .1 


28 


DEX0448_022 . nt - 2 


6_DR51;3849138- 
3851915 


149-497 


14 0 




28 


DEX044 8_022.nt.2 


C TYO CT •'30>t01'aO 

o_^ukdx ;3o^yxjb- 
3851915 


109-513 


141 


DEX0448_022.orf .2 


29 


DEX0448 023.nt.l 


19ql3.2 


115-777 


142 


DEX0448 023.aa.l 


29 


t)EX0448 023.nt.l 


19ql3.2 


22-882 


143 


DEX0448 023. orf. 1 


30 


DEX0448 023. nt. 2 


19ql3.2 


387-831 


144 


DEX0448 023. aa. 2 


30 


DEX0448 023. nt. 2 


19ql3.2 


1337-1786 


145 


DEX044 8 023. orf, 2 


31 


DEX0448 023. nt. 3 


10q26 .3 


750-3186 


146| 


DEX0448 023. aa. 3 


31 


DEX0448 023. nt. 3 


10q26.3 


742-2067 


147| 


DEX0448 023. orf. 3 
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32 


DEX0448 024.nt.l 


19pl3.12 


37-1091 


148| 


DEX044 8 024.aa.l 


32 


DEX0448 024.nt.l 


19pl3.12 


1-669 


149| 


DEX044e 024.orf.l 


33 


DEX0448 025.nt.l 


llplS.S 


1-783 


150| 


DEX0448 025.aa.l 


33 


DEX0448 025.nt.l 


llpl5.5 


3-779 


15l| 


DEX0448 O25.orf.l 


34 


DEX0448 026.nt.l 


19ql3.2 


179-2128 


152| 


DEX0448 026.aa.l 


35 


DEX0448 026. nt. 2 


19ql3.2 


341-1504 


153| 


DEX0448 026. aa. 2 


36 


DEX0448 027.nt.l 


3p26.1 


2590-2842 


154 


DEX0448 027.aa.l 


36 


DEX0448 027.nt.l 


3p26.1 


3377-4513 


155 


DEX0448 027.orf.l 


37 


DEX0448 027. nt. 2 


3p26.1 


2590-2842 


154 


DEX044B 027.aa.l 


37 


DEX044B 027. nt. 2 


3p26.1 


3245-4381 


156 


DBX0448 027.orf.2 


38 


DEX0448 027. nt. 3 


3p26.1 


2590-3667 


157 


DEX0448 027. aa. 3 


38 


DEX0448 027. nt. 3 


3p26.1 


2677-3663 


158 


DEX0448 027.orf.3 


39 


DEX0448 027. nt. 4 


3p26.1 


2590-2842 


154J 


DEX 04 4 a_0 27.aa.l | 


39 


DEX0448 027. nt. 4 


3p26.1 


2970-4244 


159 


DEX044 8 027. or f. 4 


40 


DEX0448 027. nt. 5 


3p26.1 


2590-2842 


154| 


DEX0448 027.aa.l | 


40 


DEX0448 027. nt. 5 


3p26.1 


2970-4244 


160 


DEX0448 027.orf.S 


41 


DEX044 8 027. nt. 6 


3p26.1 


1-195 


16lt 


DEX0448 027. aa. 6 


41 


DEX0448 027. nt. 6 


3p26.1 


354-686 


162| 


DEX0448 027.orf.6 


42 


DEX0448 028.nt.l 


6p21.32 


436-874 


163 


DEX0448 028.aa.l 


42 


DEX0448 028.nt.l 


6D21.32 


61-870 


164 


DEX0448 028,orf.l 


43 


DEX0448 028. nt. 2 


6p21.32 


186-624 


163 


DEX044 8 028.aa.l 


43 


DEX0448 028. nt. 2 


6p21.32 


80-619 


165 


DEX0448 028.orf.2 


44 


DEX0448 029.nt.l 


19ql3.33 


101-2488 


166 


DEX0448 029.aa.l 


45 


DEX044 8 029.nt.2 


19ql3.33 


163-2043 


167 


DEX0448 029. aa. 2 


46 


DEX0448 029.nt-3 


19ql3.33 


166-2046 


168 


DEX044 8 029. aa. 3 


47 


DEX0448 029. nt. 4 


19ql3.33 


166-1614 


169 


DEX0448 029. aa. 4 


48 


DEX044 8 029. nt. 5 


19ql3.33 


38-1852 


170 


DEX0448 029. aa. 5 


49 


DEX0448 029. nt. 6 


19ql3,33 


38-1522 


171 


DEX0448 029. aa. 6 


50 


DEX044 8 029. nt. 7 


19ql3.33 


26-1633 


|l72 


DEX0448 029.orf.7 


50 


DEX0448 029. nt. 7 


19ql3.33 


1-1635 


|l73 


DEX0448 029. aa. 7 


51 


DEX0448 030.nt.l 


lq25.3 


2-1447 


|l74 


DEX0448 030.orf.l 




DEX0448 OBO.nt.l 


lq25.3 


91-1455 


|l75 


DEX0448 030.aa.l 


52 


DEX044 8 031.nt.l 


lq24.2 


324-1343 


|l75 


DEX0448 031.aa.l 


[53 


DEX0448 032.nt.l 


8q24.3 


46-957 


|l77 


DEX0448 O32.orf.l 


53 


DEX0448 032.nt.l 


8q24.3 


380-938 


178 


DEX0446 032.aa.l 


54 


DEX0448 033.nt.l 


18pll.32 


3-1454 


179 


DEX0448 O33.orf.l 


54 


DEX0448 033.nt.l 


18pll.32 


3-1455 


180 


DEX0448 033.aa.l 


55 


DEX0448 033. nt. 2 


18pll.32 


3-1454 


181 


|DEX0448_033.orf .2 


55 


DEX0448 033. nt. 2 


|l8pll.32 


3-1455 


180 


[DEX0448 033,aa.l 


56 


DEX0448 033. nt. 3 


|l8pll.32 


3-1454 


182 


|DEX0448_033.orf .3 


56 


DEX0448 033. nt. 3 


|l8pll.32 


3-1455 


|l80 


|dEX044 8_033 .aa.l 


57 


OEX0448 033. nt. 4 


|l8pll.32 


1199-1945 


|l83 


|dEX0448 033.orf.4 


57 


DEX0448 033. nt. 4 


ll8pll.32 


3-669 


|l84 


lDEX0448_033.aa.4 


58 


DEX0448 033. nt. 5 


|l8pll.32 


3-1250 


|l85 


|DEX044 8_033.orf .5 


58 


DEX0448 033. nt. 5 


|l8pll.32 


3-1251 


|l86 


|DEX0448_033.aa.5 


59 


DEX04 4 8 033. nt. 6 


|l8pll.32 


3-1454 


|l87 


[DEX0448_033.orf .6 




DEX044 8 033. nt. 6 


|l8pll.32 


3-1455 


|l8C 


|DEX0448_033.aa.l 
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60 


DEX0448 033. nt. 7 


18pll.32 1 


3-1049 


188 


DEX0448_033.orf .7 


60 


DEX044 8 033. nt. 7 


18pll.32 1 


134-1052 


189 


DEX0448 033. aa. 7 


61 


DEX0448 033. nt. 8 


18pll.32 


3-1454 


190 


DEX0448_033.orf .8 


61 


DEX044B 033.nt.8 


18pll.32 


3-1455 


180 


DEX0448_033.aa.l 


62 


DEX0448 033. nt. 9 


18pll.32 


3-1454 


191 


t)BX044 8_033.orf .9 


62 


DEX0448 033. nt. 9 


IBpll .32 


3-1455 


180 


DEX0448 033.aa.l 


63 


DEX0448 033.nt.lO 


18011.32 


3-1454 


192 


DEX0448 O33.orf.l0 


63 


DEX0448 033.nt.lO 


18pll.32 


3-1455 


180 


DEX0448 033.aa.l 


64 


DEX0448 034.nt.X 


15q25.3 


1-417 


193 


DEX0448 Q34.aa.l 


65 


DEX0448 035.nt.l 


lq42.3 


61-1218 


194 


DEX0448 035.aa.l 


66 


DEX0448 035. nt. 2 


lq42.3 


61-1221 


194 


DEX0448 035.aa.l 


67 

ei" 


DEX0448 035. nt. 3 


lq42.3 


61-1221 


194 


DEX0448 035.aa.l 


DEX0448 035. nt. 4 


lq42.3 


61-1536 


195 


DEX0448 035. aa. 4 


69 


DEX0448 036.nt.l 


4pl6 .3 


407-1480 


196 


DEX0448 036.aa.l 


70 


DEX0448 036. nt. 2 


4pl6.3 


167-1262 


197 


DEX0448 036. aa. 2 


70 


DEX0448 036. nt. 2 


4pl6.3 


466-1041 


198 


DEX0448 036.orf.2 


71 


DEX0448 036. nt. 3 


4pl5.3 


31-569 


199 


DEX0448 036. aa. 3 


DEX0448 036. nt. 3 


4pl6 .3 


76-561 


200 


DEX0448 036.orf.3 


DEX0448 037.nt.l 


llDlS.S 


3-818 


201 


DEX0448 037.aa.l 


73 1 


DEX0448 037. nt. 2 


llplS.5 


3-911 


202 


DEX0448 037. aa. 2 


74 ! 


DEX044e 037. nt. 3 


llpl5.5 


494-1757 


203 


DEX0448 037. aa. 3 


74 


DEX0448 037. nt. 3 


[llpl5.5 


1016-1756 


204 


DEX0448 037.orf.3 


75 


DEX0448 037. nt. 4 


[llpl5.5 


240-947 


205 


DEX0448 037.aa.4 


76 

rT 


DEX0448 037. nt. 5 


llpl5.5 


240-1007 


206 


DEX0448 037. aa. 5 


DEX0448 037. nt. 6 


llpl5.5 


240-869 


207 


DEX0448 037, aa. 6 


78 


DEX0448 038.nt.l 


lBpll.32 


1-179 


208 


DEX0448 038.aa.l 


78 


DEX0448 038.nt.l 


18pll.32 


22-294 


209 


DEX044 8 O3 8.orf.l 


79 


DEX0448 039.nt.l 


llqll 


620-877 


210 


DEX044 8 039.aa.l 


80 


DEX0448 040.nt.l 


16pl3.3 


1-325 


211 


DEX044 8 04 0.aa.l 


80 


DEX0448 040.nt.l 


16pl3.3 


52-321 


212 


DEX0448 O40.orf.l 


81 


DEX0448 040. nt. 2 


16pl3.3 


43-625 


213 


DEX0448 040. aa. 2 


|81 


DEX0448 040. nt. 2 


16pl3.3 


51-617 


214 


DEX0448 040.orf.2 




DEX044B 040.nt.3 


16pl3.3 


22-630 ^ 


215 


DEX0448 040. aa. 3 


82 


DEX0448 040. nt. 3 


16pl3.3 


|3-626 


216 


DEX0448 040.orf.3 


83 


DEX0448 040. nt. 4 


16pl3.3 


|l-567 


217 


DEX0448 040. aa. 4 


84 


DEX0448 040.nt.5 


16pl3.3 


|l-517 


218 


DEX044 8 04 0.aa.5 


84 


DEX0448 040. nt. 5 


16pl3.3 


|1-513 


219 


DEX0448 040.orf.5 


85 


DEX0448 040. nt. 6 


16pl3.3 


[1352-1823 


220 


DEX0448 040.aa.6 


85 


DEX0448 040. nt. 6 


16pl3 .3 


|1347-1814 


221 


DEX0448 040.orf.6 


86 


DEX0448 040.nt.7 


16pl3 .3 


134-265 


222 


DEX0448 040. aa. 7 


86 


DEX0448 040. nt. 7 


16pl3.3 


|504-920 


223 


DEX0448 040.orf.7 


87 


DEX0448 041.nt.l 


2q37.1 


|151-1854 


224 


DEX0448 041.aa.l 


88 


DEX0448 041. nt. 2 


2q37.1 


[151-1710 


225 


DEX0448 041. aa. 2 


89 


DEX0448 042.nt.l 


2q33.1 


1188-1765 


226 


DEX0448 042.aa.l 


|90 


DEX0448 043.nt.l 


15q22.31 


ll-366 


227 


DEX0448 043.aa.l 


|9 0 


DEX0448 043.nt.l 


15q22.31 


|l54-366 


228 


DEX0448 043.orf.J. 


|91 


DEX0448 043.nt.2 


15q22.31 


11-730 


229 


DEX0448 043. aa. 2 
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|91 ||DEX0448 043. nt. 2 


15q22.31 |24>1049 


[23 0 


|DEX044 8 043.orf.2 


|92 ||deX0448 044.nt:.l 


20al3 33 Ih-:?1Q 




PEA0448 044.aa.l 


[92 


|DEX04 4 8 044.nt.l 


20ql3.33 |48-356 


|2^32 


|DSX0448_044.orf .1 


l93 


IDEX0448 044. nt. 2 


20ql3.33 l|l-219 


|231 


DEX0448 044.aa.l 


|93 


IDEX0448 044. nt. 2 


20ql3.33 1148-383 


I233 


DEX0448 044.orf.2 


|94 


IDEX0448 044. nt. 3 


20ql3.33 |l-265 


|234 


DEX0448 044. aa. 3 


(94 


|DEX0448 044. nt. 3 


20ql3.33 |2-259 


235| 


DEX0448 044.orf.3 




DEX044 8 044. nt. 4 


20ql3.33 1131-265 


236| 


DEX0448 044. aa. 4 


|95 1 


DEX0448 044. nt. 4 


20ql3.33 113-260 


237 


DEX0448 044.orf.4 



The polypeptides of the present invention were analyzed and the following 
attributes were identified; specifically, epitopes, post translational modifications, signal 
5 peptides and transmembrane domains. Antigenicity (Epitope) prediction was performed 
through the antigenic module in the EMBOSS package. Rice, P., EMBOSS: The 
European Molecular Biology Open Software Suite, Tre?ids in Genetics 16(6): 276-277 
(2000). The antigenic module predicts potentially antigenic regions of a protein sequence, 
using the method of Kolaskar and Tongaonkar. Kolaskar, AS and Tongaonkar, PC, A 
1 0 semi-empirical method for prediction of antigenic determinants on protein antigens, FEBS 
Letters 276: 172-174 (1990). Examples of post-translational modifications (PTMs) and 
other motifs of the CSPs of this invention are listed below. In addition, antibodies that 
specifically bind such post-translational modifications maybe usefiil as a diagnostic or as 
therapeutic. The PTMs and other motifs were predicted by using the ProSite Dictionary of 

1 5 Proteins Sites and Patterns (Bairoch et al. Nucleic Acids Res. 25(1 ):2 1 7-22 1 (1 997)), the 
following motifs, including PTMs, were predicted for the CSPs of the invention. The 
signal peptides were detected by using the SignalP 2.0, see Nielsen et al. Protein 
Engineering 12, 3-9 (1999). Prediction of transmembrane helices in proteins was 
performed by the application TMHMM 2.0, "currently the best performing transmembrane 

20 prediction program", according to authors (Krogh et al , Journal of Molecular Biology, 
305(3):567-580, (2001); UoWtr et al.Bioinformatics, 17(7):646-653, (2001); 
Sonnhammer, et al.,A hidden Markov model for predicting transmembrane helices in 
protein sequences in Glasgow, et al Ed. Proceedings of the Sixth International 
Conference on Intelligent Systems for Molecular Biology , pages 175-182, Menlo Park, 

25 CA, 1998. AAAI Press. The PSORT II program may also be used to predict cellular 

localizations. Horton ti^h. Intelligent Systems for Molecular Biology 5\ 147-152(1997). 
The table below includes the following sequence annotations: Signal peptide presence; 
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TM (number of membrane domain, topology in orientation and position); Amino acid 
location and antigenic index (location, AI score); PTM and other motifs (type, amino acid 
residue locations); and functional domains (type, amino acid residue locations). 
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Example lb: Sequence Alignment Support 

Alignments between previously identified sequences and splice variant sequences 
are performed to confirm unique portions of splice variant nucleic acid and amino acid 
sequences. The alignments are done using the Needle program in the European Molecular 
5 Biology Open Software Suite (EMBOSS) version 2.2.0 available at www.emboss.org 
from EMBnet (http://www.embnet.org). Default settings are used unless otherwise noted. 
The Needle program in EMBOSS implements the Needleman-Wunsch algorithm. 
Needleman, S. B., Wunsch, C. D., 1 Mol Biol 48:443-453 (1970). 

It is well know to those skilled in the art that implication of alignment algorithms 
10 by various programs may result in minor changes in the generated output. These changes 
include but are not limited to: alignment scores (percent identity, similarity, and gap), 
display of nonaligned flanking sequence regions, and number assignment to residues. 
These minor changes in the output of an alignment do not alter the physical characteristics 
of the sequences or the differences between the sequences, e.g. regions of homology, 
1 5 insertions, or deletions. 

Example Ic: RT-PCR Analysis 

To detect the presence and tissue distribution of a particular splice variant Reverse 
Transcription-Polymerase Chain Reaction (RT-PCR) is performed using cDNA generated 
from a panel of tissue RNAs. See, e,g., Sambrook et al^ Molecular Cloning: A Laboratory 

20 Manual 2d ed.. Cold Spring Harbor Laboratory Press (1989) and; Kawasaki ES et al, 
PNAS 85(15):5698 (1988). Total RNA is extracted from a variety of tissues and first 
strand cDNA is prepared with reverse transcriptase (RT). Each panel includes 23 cDNAs 
from five cancer types (lung, ovary, breast, colon, and prostate) and normal samples of 
testis, placenta and fetal brain. Each cancer set is composed of three cancer cDNAs from 

25 different donors and one normal pooled sample. Using a standard enzyme kit from BD 
Bioscience Clontech (Mountain View, CA), the target transcript is detected with 
sequence-specific primers designed to only amplify the particular splice variant. The PCR 
reaction is run on the GeneAmp PCR system 9700 (Applied Biosystem, Foster City, CA) 
thermocycler under optimal conditions. One of ordinaiy skill can design appropriate 

30 primers and determine optimal conditions. The amplified product is resolved on an 

agarose gel to detect a band of equivalent size to the predicted RT-PCR product. A band 



wo 2004/050860 



PCT/US2003/040063 



180 

indicated the presence of the splice variant in a sample. The relation of the amplified 
product to the splice variant was subsequently confirmed by DNA sequencing. 

After subcloning, all positively screened clones are sequence verified. The DNA 
sequence verification results show the splice variant contains the predicted sequence 
differences in comparison with the reference sequence. 

Results for RT-PCR analysis include the Sequence DEX ID, Lead Name, Cancer 
Tissue(s) the transcript was detected in. Normal Tissue(s) the transcript was detected in, 
the predicted length of the RT-PCR product, and the Confirmed Length of the RT-PCR 
product. 

RT-PCR results confimi the presence SEQ ID NO: 1-95 in biologic samples and 
distinguish between related transcripts. 

Example Id: Secretion Assay 

To determine if a protein encoded by a splice variant is secreted from cells a 
secretion assay is preformed. A pcDNA3.1 clone containing the gene transcript which 
encodes the variant protein is transfected into 293T cells using the Superfect transfection 
reagent (Qiagen, Valencia CA). Transfected cells are incubated for 28 hours before the 
media is collected and immediately spun down to remove any detached cells. The 
adherent cells are solubilized with lysis buffer (1% NP40, lOmM sodium phosphate 
pH7.0, and 0.1 5M NaCl). The lysed cells are collected and spun down and the 
supernatant extracted as cell lysate. Western immunoblot is carried out in the following 
manner: 15p.l of the cell lysate and media are run on 4-12% NuPage Bis-Tris gel 
(Invitrogen, Carlsbad CA), and blotted onto a PVDF membrane (Invitrogen, Carlsbad 
CA). The blot is incubated wift a polyclonal primary antibody which binds to the variant 
protein (Imgenex, San Diego CA) and polyclonal goat anti-rabbit-peroxidase secondary 
antibody (Sigma-Aldrich, St. Louis MO). The blot is developed with the ECL Plus 
chemiluminescent detection reagent (Amersham Biosciences, Piscataway NJ). 

Secretion assay results are indicative of SEQ ID NO: 96-237 being a diagnostic 
marker and/or therapeutic target for cancer. 

Example 2a: Gene Expression Analysis 

Custom Microarray Experiment - Cancer 

Custom oligonucleotide microarrays were provided by Agilent Technologies, Inc. 
(Palo Alto, CA). The microarrays were fabricated by Agilent using their technology for 
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the in-situ synthesis of 60iner oligonucleotides (Hughes, et al. 2001, Nature Biotechnology 
19:342-347). The 60meT microarray probes were designed by Agilent, from gene 
sequences provided by diaDexus, using Agilent proprietary algorithms. Whenever 
possible two different 60mers were designed for each gene of interest. 
5 All microarray experiments were two-color experiments and were preformed using 

Agilent-recommended protocols and reagents. Briefly, each microarray was hybridized 
with cRNAs synthesized from UNA (total RNA for ovarian and prostate, polyA+ RNA for 
lung, breast and colon samples), isolated from cancer and normal tissues, labeled with 
fluorescent dyes Cyanine3 (Cy3) or CyanineS (Cy5) (MEN Life Science Products, Inc., 

10 Boston, MA) using a linear amplification method (Agilent), In each experiment the 
experimental sample was RNA isolated from cancer tissue from a single individual and 
the reference sample was a pool of RNA isolated from normal tissues of the same organ as 
the cancerous tissue {i.e. normal ovarian tissue in experiments with ovarian cancer 
samples). Hybridizations were carried out at 60°C, overnight using Agilent in-situ 

15 hybridization buffer. Following washing, arrays were scanned with a GenePix 4000B 

Microarray Scanner (Axon Instruments, Inc., Union City, CA). The resulting images were 
analyzed with GenePix Pro 3.0 Microarray Acquisition and Analysis Software (Axon). 

Data normalization and expression profiling were done with Expressionist 
software from GeneData Inc. (Daly City, CA/Basel, Switzerland). Gene expression 

20 analysis was performed using only experiments that met certain quality criteria. The 

quality criteria that experiments must meet are a combination of evaluations performed by 
the Expressionist software and evaluations perfoimed manually using raw and normalized 
data. To evaluate raw data quality, detection limits (the mean signal for a replicated 
negative control + 2 Standard Deviations (SD)) for each channel were calculated. The 

25 detection limit is a measure of non-specific hybridization. Acceptable detection limits 
were defined for each dye (<80 for Cy5 and <150 for Cy3). Arrays with poor detection 
limits in one or both channels were not analyzed and the experiments were repeated. To 
evaluate normalized data quahty, positive control elements included in the array were 
utilized. These array features should have a mean ratio of 1 (no differential expression). 

30 If these features have a mean ratio of greater than 1 .5-fold up or down, the experiments 
were not analyzed further and were repeated. In addition to traditional scatter plots 
demonstrating the distribution of signal in each experiment, the Expressionist software 
also has minimum thresholding criteria that employ user defined parameters to identify 
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• quality data. These thresholds include tu'o distinct quality measurements: 1) minimum 
area percentage, which is a measure of the integrity of each spot and 2) signal to noise 
ratio, which ensures that the signal being measured is significantly above any background 
(nonspecific) signal present. Only those features that met the threshold criteria were 
5 included in the filtering and analyses carried out by Expressionist. The thresholding 

settings employed require a minimum area percentage of 60% [(% pixels > background + 
2SD)-(% pixels saturated)], and a minimum signal to noise ratio of 2.0 in both channels. 
By these criteria, very low expressors, saturated features and spots with abnormally high 
local background were not included in analysis. 

10 Relative expression data was collected from Expressionist based on filtering and 

clustering analyses. Up-regulated genes were identified using criteria for the percentage 
of experiments in which the gene is up-regulated by at least 2-fold. In general, up- 
regulation in -30% of samples tested was used as a cutoff for filtering. 

Two microarray experiments were preformed for each normal and cancer tissue 

15 pair. The tissue specific Array Chip for each cancer tissue is a unique microarray specific 
to that tissue and cancer. The Multi-Cancer Array Chip is a universal microarray that was 
hybridized with samples from each of the cancers (ovarian, breast, colon, lung, and 
prostate). See the description below for the experiments specific to the different cancers. 

Microarray Experiments and Data Tables 

20 COLON CANCER CHIPS 

For colon cancer two different chip designs were evaluated with overlapping sets 
of a total of 38 samples, comparing the expression patterns of colon cancer derived 
polyA+ RNA to polyA+ RNA isolated from a pool of 7 normal colon tissues. For the 
Colon Array Chip all 38 samples (23 Ascending colon carcinomas and 15 Rectosigmoidal 

25 carcinomas including: 5 stage I cancers, 15 stage H cancers, 15 stage in and 2 stage IV 
cancers, as well as 28 Grade 1/2 and 10 Grade 3 cancers) were analyzed. The 
histopathologic grades for cancer are classified as follows: GX, cannot be assessed; Gl, 
well differentiated; G2, Moderately differentiated; G3, poorly differentiated; and G4, 
undifferentiated. AJCC Cancer Staging Handbook. 5^^ Edition, 1998, page 9. For the 

30 Colon Array Chip analysis, samples were further divided into groups based on the 

expression pattern of the known colon cancer associated gene Thymidilate Synthase (TS) 
(13 TS up 25 TS not up). The association of TS with advanced colorectal cancer is well 
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documented. Paradise et al, Br J Cancer 82(3):560-7 (2000); Etienne et aLJClin 
Oncol 20(12):2832-43 (2002); Aschele et al Clin Cancer Res, 6(12):4797-802 (2000). 
For the Multi-Cancer Array Chip a subset of 27 of these samples (14 Ascending colon 
carcinomas and 13 Rectosigmoidal carcinomas including: 3 stage I cancers, 9 stage n 

5 cancers, 13 stage III and 2 stage IV cancers) were assessed. 

The results for the statistically significant up-regulated genes on the Colon Array 
Chip are shown in Table 1 and 2. The results for the statistically significant up-regulated 
genes on the Multi-Cancer Array Chip are shown in Table 3. 

The first two columns of each table contain information about the sequence itself 

10 (Seq ID, Oligo Name), the next columns show the results obtained for all ("ALL") the 
colon samples, ascending colon carcinomas ("ASC"), Rectosigmoidal carcinomas ("RS"), 
cancers corresponding to stages I and II ("STU2"), stages III and IV («ST3,4"), grades 1 
and 2 ("GR1,2"), grade 3 ("GR3"), cancers exhibiting up-regulation of the TS gene 
CTSup") or those not exhibiting up-regulation of the TS gene ("NOT TSup"). *%up' 

1 5 indicates the percentage of all experiments in which up-regulation of at least 2-fold was 
observed n=38 for the Colon Array Chip (n=27 for the Multi-Cancer Array Chip), '%valid 
up' indicates the percentage of experiments with valid expression values in which up- 
regulation of at least 2-fold was observed. 



Table 1. 



DEX ID 


Oligo 
Name 


Cln 
ALL 
%up 
n«38 


Cln 
ALL % 
valid 
up 
n=3B 


Cln 
ASC 
%up 
n=23 


Cln 
ASC % 
valid 
up 
n=23 


Cln 
RS 
%up 
n=15 


Cln 
RS % 
valid 
up 
n=15 


Cln 
ST1,2 
%up 
n=20 


Cln 

ST1,2 

% 

valid 
up 

n=:20 


Cln 
ST3,4 
%up 
n=18 


Cln 
ST3,4 

% 

valid 

up 

n=18 


DEX0448 OOl.nt.l 


34940.0 


23.7 


23.7 


30.4 


30.4 


13.3 


13.3 


25.0 


25.0 


22.2 


22.2 


DEX0448 002.nt.l 


39957.0 


52.6 


52.6 


56.5 


56.5 


46.7 


46.7 


70.0 


70.0 


33.3 


33.3 


DEX0448 002.nt.l 


39958.0 


44.7 


44 .7 


43 .5 


43.5 


46.7 


46.7 


60.0 


60.0 


27.8 


27.8 


DEX0448 OOS.nt.l 


32057.0 


18.4 


18.4 


26.1 


26.1 


6.7 


6.7 


15.0 


15.0 


22.2 


22.2 


DEX0448 004.nt.l 


41210.0 


21.1 


21.1 


30.4 


30.4 


6.7 


6.7 


15.0 


15.0 


27.8 


27.8 


DEX0448 005. nt. 2 


36243.0 


34.2 


34.2 


43.5 


43.5 


20.0 


20,0 


40.0 


40.0 


27.8 


27.8 


DEX0448 006.nt.l 


8410.0 


13.2 


13.2 


13.0 


13.0 


13.3 


13.3 


15.0 


15.0 


11.1 


11.1 


DEX0448 007.nt-l 


26449.0 


10.5 


10.8 


13 .0 


13.0 


6.7 


7.1 


5.0 


5.0 


16.7 


17,6 


DEX0448 OOS.nt.l 


32851.0 


28.9 


28.9 


30.4 


30.4 


26.7 


26.7 


35.0 


35.0 


22.2 


22.2 


DEX0448 009.nt.l 


34680. 0 


55.3 


61.8 


56.5 


59.1 


53.3 


66.7 


45.0 


56.2 


66.7 66.7 


DEX0448 009.nt.l 


40809.0 


57.9 


57.9 


56.5 


56.5 


60.0 


60.0 


55.0 


55.0 


61.1 161.1 


DEX0448 009.nt.l 


40810.0 


57.9 


57.9 


56.5 


56.5 


60.0 


60.0 


55.0 


55.0 


61.1 |61.1 


DEX0448 OlO.nt.l 


28423.0 


18.4 


18,9 


30.4 


30.4 


0.0 


0.0 


20.0 


20.0 


16.7 17.6 


DEX0448 Oll.nt.l 


8312.0 


2.6 


2.8 


4.3 


4.3 


0.0 


0.0 


0.0 


0.0 


5.6 5.9 


DEX0448 013.nt.l 


16006.0 


18.4 


18.4 


26 .1 


26 .1 


6.7 


6.7 


5.0 


5,0 


33.3 33.3 


DEX0448 014.nt.l 


37376.0 


23.7 


23.7 


13.0 


13.0 


40.0 


40.0 


20.0 


20.0 


27.8 27.8 


DEX0448 014.nt.l 


37378.0 


21.1 


21.1 


13.0 


13.0 


33 .3 


33.3 


20.0 


20.0 


22.2 22.2 


DEX044 8 OlS.nt.l 


33348.0 


34.2 


34.2 


34.8 


34.8 


33.3 


33.3 


30.0 


30.0 


38.9 38.9 


DEX044 8 OlS.nt.l 


38996.0 


28.9 


29.7 


26.1 


27.3 


33.3 


33.3 


25.0 


25.0 


33.3 135.3 
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DEX0448 017.nt.l 


36179. C 


15.8 


16.7 


8.7 


9.5 


26.7 


26.7 


15.0 


16.7 


16.7 


16.7 


DEX0448 017.nt.l 


36180. 0 


10.5 


11 .1 


8.7 


9.5 


13.3 


13.3 


15.0 


16.7 


5.6 


5.6 


DEX0448 017.nt.l 


36181.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 017.nt.l 


36182.0 


2.6 


3.2 


4.3 


5.6 


0.0 


0.0 


0.0 


0.0 


5.6 


6.2 


DEX0448 0i7.nt.l 


36184.0 


36.8 


36.8 


43.5 


43.5 


26.7 


26.7 


45.0 


45.0 


27.8 


27 .8 


DEX0448 017,nt.l 


37227.0 


5.3 


6.5 


4.3 


5.9 


6.7 


7.1 


5.0 


6.2 


5.6 


6.7 


DEX0448 017,nt.l 


37228.0 


2.6 


3.1 


4.3 


5.6 


0.0 


0.0 


0.0 


0.0 


5.6 


6.2 


DEX0448 017.nt.l 


37239.0 


28.9 


28.9 


34.8 


34 .8 


20.0 


20.0 


30.0 


30.0 


27.8 


27.8 


DEX0448 017.nt.l 


37240.0 


34 .2 


35.1 


39.1 


39.1 


26.7 


28.6 


35.0 


35.0 


33.3 


35.3 


DEX0448 017.nt.l 


37895.0 


5.3 


10.5 


0.0 


0.0 


13.3 


33.3 


0.0 


0.0 


11.1 


20.0 


DEX0448 OlB.nt.l 


41823 .0 


23.7 


23.7 


26.1 


26.1 


20.0 


20.0 


25.0 


25.0 


22.2 


22 .2 


DEX044 8 OlS.nt.l 


41824.0 


15.8 


15.8 


21.7 


21.7 


6.7 


6.7 


15.0 


15.0 


16.7 


16.7 


DEX0448 020.nt.l 


39655.0 


44.7 


44.7 


56.5 


56.5 


26.7 


26.7 


40.0 


40.0 


50.0 


50.0 


DEX0448 020.nt.l 


39656.0 


26.3 


26.3 


30.4 


30.4 


20.0 


20.0 


25.0 


25.0 


27.8 


27.8 


DEX0448 02l.nt.l 


30819.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021.nt.l 


30870.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021.nt.l 


30931.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021.nt.l 


31146.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 02l.nt.l 


31154 .0 


7.9 


7.9 


8.7 


8.7 


6.7 


6.7 


0.0 


0.0 


16.7 


16.7 


DEX0448 021.nt.l 


31155.0 


7.9 


8.6 


8.7 


10.0 


6.7 


6.7 


5.0 


5.6 


11.1 


11.8 


DEX0448 02l,nt.l 


31157.0 


2.6 


2.6 


0.0 


0.0 


6.7 


6.7 


0.0 


0.0 


5.6 


5.6 


DEX0448 021,nt.l 


35218.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0,0 


0.0 


DEX0448 021. nt. 2 


30819.0 


0.0 


0,0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021. nt. 2 


30870.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021. nt. 2 


30931.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021. nt. 2 


31146 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448_021.nt .2 


31154 .0 


7.9 


7.9 


8.7 


8.7 


6.7 


6.7 


0.0 


0.0 


16.7 


16.7 


DEX0448 021. nt. 2 


31155.0 


7.9 


8.6 


8.7 


10.0 


6.7 


6.7 


5.0 


5.6 


11.1 


11.8 


DEX0448 021. nt. 2 


31157.0 


2.6 


2.6 


0.0 


0.0 


6.7 


6.7 


0.0 


0.0 


5.6 


5.6 


DEX0448 021. nt. 2 


35218.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX044 8 021.nt.3 


30819.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021. nt. 3 


30870.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021. nt. 3 


31154 .0 


7.9 


7.9 


8.7 


8.7 


6.7 


6.7 


0.0 


0.0 


16.7 


16.7 


DEX0448 021.nt.3 


31157.0 


2 . 6 


2.6 


0.0 


0.0 


6.7 


6.7 


0.0 


0.0 


5.6 


5.6 


DEX0448 021. nt. 3 


35218 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 022.nt.l 


36878 .0 


36.8 


37.8 


47.8 


47.8 


20.0 


21.4 


25.0 


26.3 


50.0 


50.0 


DEX0448 022.nt.l 


37415.0 


52.6 


52.6 


69.6 


69.6 


26.7 


26.7 


50.0 


50.0 


55.6 


55.6 


DEX0448 022. nt. 2 


36878.0 


36.8 


37.8 


47.8 


47.8 


20.0 


21.4 


25.0 


26.3 


50.0 


50.0 


DEX0448 022. nt. 2 


37415 .0 


52.6 


52.6 


69.6 


69.6 


26.7 


26.7 


50.0 


50.0 


55.6 


55.6 


DEX0448 023.nt.l 


22297.0 


21.1 


21.1 


30.4 


30.4 


6.7 


6.7 


20.0 


20.0 


22.2 


22.2 


DEX0448 024.nt.l 


19607.0 


36.8 


36.8 


43.5 


43.5 


26.7 
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34.8 


26.7 


26.7 


40.0 


40,0 


22.2 


22.2 


DEX0448 040.nt.4 


31346.0 


36.8 


36.8 


39.1 


39.1 


33.3 


33.3 


40.0 


40.0 


33.3 


33.3 


DEX0448 040. nt. 4 


31347.0 


31.6 


32.4 


34.8 


34.8 


26.7 


28.6 


40.0 


40.0 


22.2 


23.5 


DEX044 8 04 0.nt.4 


40755.0 


28.9 


28.9 


34.8 


34 .8 


20.0 


20.0 


35.0 


35.0 


22.2 


22.2 


DEX0448 040. nt. 4 


40756.0 


31.6 


31.6 


34.8 


34.8 


26.7 


26.7 


40.0 


40.0 


22.2 


22.2 


DEX044 8 04 0.nt.5 


31346.0 


36.8 


36.8 


39.1 


39.1 


33.3 


33.3 


40.0 


40.0 


33.3 


33.3 


DEX0448 040. nt. 5 


31347.0 


31.6 


32.4 


34.8 


34 .8 


26.7 


28.6 


40.0 


40.0 


22 .2 


23.5 


DEX044B 040. nt. 5 


40755.0 


28.9 


28 . 9 


34.8 


34.8 


20.0 


20.0 


35.0 


35 . 0 


22.2 


22 .2 


DEX044 8 04 0.nt,5 


40756.0 


31.6 


31.6 


34,8 


34.8 


26.7 


26.7 


40.0 


40.0 


22.2 


22.2 


DEX0448 040. nt. 6 


31346.0 


36.8 


36.8 


39.1 


39.1 


33 .3 


33 .3 


40.0 


40.0 


33.3 


33.3 


DEX0448 040. nt. 6 


31347.0 


31.6 


32 .4 


34 .8 


34.8 


26.7 


28.6 


40.0 


40.0 


22.2 


23.5 


DEX0448 040.nt.€ 


35943.0 


0.0 


0.0 


0.0 


0.0 


0.0 


^.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 04 0.nt.6 


35944.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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DEX0448 040. nt. 6 


40755 . 0 


28.9 


28.9 


'i A Q 




on n 


9 n n 
z u . u 




35.0 


22 . 2 


22 . 2 


DEX0448 040. nt. 6 


40756 . 0 


31.6 


31.6 


34.8 


34.8 


2b . / 


O C "7 
. / 


^ u . u 




22.2 


22 . 2 


DEX0448 040. nt. 7 


31346 . 0 


36 . 8 


36 . 8 


3 9.1 


39.1 


3 J . 3 




4 u . u 






33 . 3 


DEX044 8 04 0.nt.7 


31347 . 0 


31 . 6 


32.4 


j4 . o 


J 4 . o 


2o . / 


Z 0 . D 


A r\ n 
fi U . u 


4 0.0 


22.2 


23 . 5 




4 0755 . 0 


28.9 


28 . 9 


34 . 8 


34 . 8 


20.0 


20.0 


35.0 


35.0 


22.2 


22.2 


DEX0448 040. nt. 7 


40756.0 


31.6 


31.6 


34.8 


34.8 


26.7 


26.7 


40.0 


40.0 


22.2 


22.2 


DEX044 8 041.nt.l 


32216.0 


52.6 


52.6 


52.2 


52.2 


53.3 


53.3 


55,0 


55.0 


50.0 


50.0 


DEX0448 042.nt.l 


29231.0 


15.8 


16.2 


13.0 


13.0 


20.0 


21.4 


20.0 


21.1 


11.1 


11.1 


DEX0448 042.nt.l 


29232.0 


15.8 


15.8 


13.0 


13.0 


20.0 


20.0 


20.0 


20.0 


IX. 1 


11.1 


DEX0448 042.nt.l 


29271.0 


23.7 


23.7 


21.7 


21.7 


26.7 


26.7 


30,0 


30.0 


16.7 


16.7 


DEX0448 042.nt.l 


29272.0 


26.3 


26.3 


21.7 


21.7 


33.3 


33.3 


30.0 


30.0 


22.2 


22.2 


DEX0448 042.nt.l 


29290.0 


31-6 


31.6 


30.4 


30.4 


33.3 


33.3 


45.0 


45.0 


16.7 


16.7 


DEX0448 043.nt.l 


9043.0 


18.4 


18.4 


21.7 


21.7 


13.3 


13.3 


15.0 


15.0 


22.2 


22.2 


DEX0448 044.nt.l 


36803.0 


28.9 


28.9 


17.4 


17.4 


46.7 


46.7 


25.0 


25.0 


33.3 


33.3 



Table 2. 





Dligo 


Zln 
3R1,2 
%up 
a=28 


^1 n 

^in 

%va,lid 
n=28 


1 

Zln 
3R3 
%up 
n=10 


Zln 
3R3 

%valid 
up 

n=10 


Zln 

rs 

ap 

%up 

n=13 


::in TS , 
Ivalid 

Lip 

ri=13 


:in i 
^OT ] 

rs 1 

ap 

%up 

n=25 


Zln 

SFOT TS 
ap 

%valid 
up 

n=25 


DEX0448 OOl.nt.l 


34940.0 


17.9 


17.9 


40.0 


40.0 


46.2 


46.2 


12 . 0 


12 . 0 


DEX0448 002.nt.l 


39957.0 


50.0 


50. 0 


60.0 


60.0 


69.2 


69 .2 


44 . 0 


44 . 0 


DEX0448 002.nt.l 


39958.0 


42 .9 


42 .9 


50.0 


50.0 


61.5 


61.5 


36 . 0 


36 . 0 


DEX0448 003.nt.l 


32057.0 


7.1 


7.1 


50.0 


50.0 


38.5 


38.5 


8 . 0 


8 . 0 


DEX0448 004.nt.l 


41210,0 


10.7 


10.7 


50.0 


50.0 


38.5 


38.5 


12 . 0 


12 . 0 


DEX0448 005. nt. 2 


36243.0 


35.7 


35,7 


30.0 


30.0 


46.2 


46.2 


28 . 0 


28.0 


DEX0448 006.nt.l 


8410.0 


14.3 


14.3 


10.0 


10.0 


30.8 


30.8 


4 . 0 


4 . 0 


DEX0448 007.nt.l 


26449 . 0 


"3 C 
J . 0 


^ . / 




30.0 


15 . 4 


15 . 4 


8 . 0 


8.3 


DEX0448 OOa.nt.l 


32851.0 


28.6 


28.6 


30.0 


30.0 


46.2 


46.2 


20.0 


20.0 


DEX044 8 009.nt.l 


34680.0 


57.1 


66.7 


50.0 


50.0 


38.5 


41.7 


64.0 


72.7 


DEX0448 009.nt.l 


40809.0 


64 .3 


64.3 


40.0 


40.0 


30.8 


30.8 


72.0 


72. 0 


DEX0448 009.nt.l 


40810.0 


64.3 


64.3 


40.0 


40.0 


30 .8 


30.8 


72.0 


72.0 


DEX0448 OlO.nt.l 


28423.0 


10.7 


11.1 


40.0 


40.0 


23.1 


23.1 


16.0 


16.7 


DEX0448 Oll.nt.l 


8312.0 


3.6 


3.7 


0.0 


0.0 


0.0 


0.0 


4.0 


4.3 


DEX0448 013.nt.l 


16006.0 


10.7 


10.7 


40.0 


40.0 


38.5 


38.5 


8.0 


8.0 


DEX0448 014.nt.l 


37376.0 


25.0 


25.0 


20.0 


20.0 


7.7 


7,7 


32.0 


32.0 


DEX0448 014.nt.l 


37378.0 


21.4 


21.4 


20.0 


20.0 


7.7 


7.7 


28.0 


28-0 


DEX0448 015.nt.l 


33348.0 


28.6 


28.6 


50.0 


50.0 


38.5 


38.5 


32.0 


32.0 


DEX0448 OlS.nt.l 


38996.0 


32.1 


32.1 


20.0 


22.2 


15.4 


15.4 


36 .0 


37.5 


DEX0448 017.nt.l 


36179.0 


17.9 


18.5 


10.0 


11.1 


7.7 


8.3 


20.0 


20.8 


DEX0448 017.nt.l 


36180.0 


10.7 


11.1 


10.0 


11.1 


7.7 


8,3 


12.0 


12.5 


DEX0448 017.nt.l 


36181.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 017.nt.l 


36182.0 


3.6 


4.2 


0.0 


0.0 


7.7 


10.0 


0.0 


0.0 


DEX0448 017.nt.l 


36184.0 


39.3 


39,3 


30.0 


30.0 


46.2 


46.2 


32.0 


32.0 


DEX0448 017.nt.l 


37227.0 


7.1 


8.3 


0.0 


0.0 


7.7 


11.1 


4.0 


4.5 


DEX044B 017.nt.l 


37228.0 


3.6 


4.0 


0.0 


0.0 


7.7 


10.0 


0,0 


0.0 


DEX0448 017.nt.l 


37239.0 


28.6 


28-6 


30.0 


30.0 


38.5 


38.5 


24.0 


24.0 


DEX0448 017.nt.l 


37240.0 


32.1 


33.3 


40.0 


40.0 


46 .2 


46.2 


28.0 


29.2 


DEX0448 017.nt.l 


37895.0 


7.1 


16.7 


0.0 


0.0 


0.0 


0.0 . 


8.0 


15.4 


DEX0448 OlS.nt.l 


41823,0 


25.0 


25.0 


20.0 


20.0 


30.8 


30.8 


20.0 


20.0 


DEX0448 OlS.nt.l 


41624.0 


14.3 


14.3 


20.0 


20.0 


23.1 


23.1 


12.0 


12.0 


DEX0448 020.nt.l 


39655. 0 


42.9 


42.9 


50.0 


50.0 


61.5 


61.5 


36.0 


36.0 


DEX0448-020.nt.l 


39656.0 


21.4 


21.4 


40-0 


40.0 


30.8 


30.8- 


24.0 


24.0 


DEX0448 021.nt.l 


30819.0 


jo.o 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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DEa0448 021.nt.l 


30870 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


DEX0448 021.nt.l 


30931 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


DEX0448 021.nt.l 


31146 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


DEX0448 021.nt.l 


31154 . 0 


0 . 0 


0 . 0 


30 . 0 


30 . 0 


15 . 4 


15 . 4 


4 . 0 


4 . 0 


DEX04 4 8 021.nt.l 


31155 . 0 


0 . 0 


0 . 0 


30.0 


33 .3 


23 . 1 


23 .1 


0.0 


0.0 


DEX0448 021.nt.l 


31157.0 


0 . 0 


0 . 0 


10.0 


10.0 


7.7 


7.7 


0.0 


0.0 


DEX0448 02l.nt.l 


35218.0 


0.0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021. nt. 2 


30819.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021. nt. 2 


30870.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021. nt. 2 


30931.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 021. nt. 2 


31146.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX044 8 021. nt. 2 


31154.0 


0.0 


0.0 


30.0 


30.0 


15.4 


15.4 


4.0 


4 , 0 


DEX0448 021. nt. 2 


31155.0 


0.0 


0.0 


30.0 


33 .3 


23 . 1 


23.1 


0.0 


0.0 


DEX0448 021. nt. 2 


31157 . 0 


0.0 


0.0 


10. 0 


10.0 


7.7 


7.7 


0.0 


0.0 


DEX044 8 021.nt.2 


35218 . 0 


0 . 0 


0 . 0 


0. 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 021. nt. 3 


30819 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 021. nt. 3 


30870.0 


0 . 0 


0 . 0 


0.0 


0. 0 


0.0 


0 . 0 


0.0 


0 . 0 


DEX0448 021. nt. 3 


31154 . 0 


0.0 


0.0 


30 . 0 


30.0 


15 . 4 


15.4 


4 . 0 


4 . 0 


DEX0448 021. nt. 3 


31157.0 


0.0 


0 . 0 


10 . 0 


10.0 


7.7 


7 . 7 


0.0 


0.0 


DEX0448 021. nt. 3 


35218 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0.0 


0.0 


DEX0448 022.nt.l 


36878 .0 


39.3 


40 . 7 


30.0 


30.0 


38.5 


38 . 5 


36 . 0 


37.5 


DEX0448 022.nt.l 


37415 . 0 


46.4 


46.4 


70 . 0 


70 . 0 


46.2 


46.2 


56 . 0 


56 . 0 


DEX0448 022. nt. 2 


36878 . 0 


39.3 


40 . 7 


30 .0 


30 . 0 


38.5 


38 .5 


36 . 0 


37.5 


DEX0448 022. nt. 2 


37415 . 0 


46 .4 


46 . 4 


70 . 0 


70 . 0 


46 . 2 


46 . 2 


56 . 0 


56 . 0 


DEX0448 023.nt,l 


22297 . 0 


17 . 9 


17 . 9 


30 . 0 


30 . 0 


38.5 


38 . 5 


12 . 0 


12 . 0 


DEX0448 024.nt.l 


19607 . 0 


32 . 1 


32 . 1 


50 . 0 


50 . 0 


46,2 


46 . 2 


32 . 0 


32 . 0 


DEX044 8 025.nt.l 


40033 . 0 


85 . 7 


85 , 7 


70 . 0 


70 . 0 


76 . 9 


76 . 9 


84 . 0 


84 . 0 


DEX0448 025 .nt . 1 


40034 . 0 


82 . 1 


82 . 1 


70 . 0 


70 . 0 


76 . 9 


76 . 9 


80 . 0 


80.0 


DEX0448 026.nt.l 


41210 . 0 


10 . 7 


10 . 7 


50. 0 


50 . 0 


38.5 


38.5 


12 . 0 


12 . 0 


DEX0448 026'nt.l 


41284 . 0 


7 . 1 


7 . 1 


0.0 


0 . 0 


0 . 0 


0 . 0 


8 . 0 


8 . 0 


DEX0448 026. nt. 2 


41210 . 0 


10 . 7 


10 . 7 


50. 0 


50 . 0 


38.5 


38.5 


12 . 0 


12 . 0 


DEX0448 026. nt. 2 


41284 . 0 


7.1 


7.1 


0.0 


0.0 


0 . 0 


0 . 0 


8 . 0 


8 . 0 


DEX0448 027.nt.l 


9622. 0 


14 .3 


14 .3 


20.0 


20.0 


30.8 


30.8 


8 . 0 


8 . 0 


DEX0448 027.nt.l 


9623 .0 


14 .3 


14 .3 


20.0 


20. 0 


30.8 


30.8 


8 . 0 


8 . 0 


DEX0448 027. nt. 2 


9622 .0 


14 .3 


14 .3 


20.0 


20.0 


30.8 


30.8 


8.0 


8.0 


DEX0448 027. nt. 2 


9623 . 0 


14 .3 


14 .3 


20 . 0 


20.0 


30.8 


30 . 8 


8.0 


8 . 0 


DEX0448 027. nt. 3 


9622 . 0 


14 .3 


14 .3 


20.0 


20.0 


30.8 


30.8 


8.0 


8 . 0 


DEX0448 027. nt. 3 


9623 . 0 


14 .3 


14 .3 


20.0 


20.0 


30.8 


30 . 8 


8.0 


8.0 


DEX0448 027. nt. 4 


9622 .0 


14.3 


14.3 


20.0 


20.0 


30.8 


30.8 


8.0 


8.0 


DEX0448 027. nt. 4 


9623 .0 


14.3 


14.3 


20.0 


20.0 


30.8 


30. 8 


8.0 


8.0 


DEX0448 027. nt. 5 


9622.0 


14 .3 


14 .3 


20.0 


20.0 


30.8 


30.8 


8.0 


8.0 


DEX0448 027. nt. 5 


9623.0 


14.3 


14.3 


20.0 


20.0 


30.8 


30.8 


8.0 


8.0 


DEX0448 027. nt. 6 


9622.0 


14 .3 


14 .3 


20.0 


20. 0 


30.8 


30.8 


8.0 


8.0 


DEX0448 027. nt. 6 


9623.0 


14 .3 


14.3 


20.0 


20.0 


30.8 


30.8 


8.0 


8.0 


DEX0448 029.nt.l 


28733 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 029.nt.l 


28734 . 0 


3.6 


3.6 


40.0 


40.0 


15.4 


15.4 


12,0 


12.0 


DEX0448 029.nt.l 


38381 . 0 


0.0 


0.0 


10.0 


11 . 1 


7 . 7 


7 . 7 


0.0 


0.0 


DEA0446 029.nt.l 


3 6362.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 029.nt,l 


38383.0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 029.nt.l 


38384.0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DBX0448 029. nt. 2 


28733.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DKX044 8 029. nt. 2 


28734.0 


3.6 


3.6 


40.0 


40.0 


15.4 


15.4 


12.0 


12.0 


DEX0448 029. nt. 2 


38381.0 


0.0 


0.0 


10.0 


11.1 


7.7 


7.7 


0.0 


0.0 


DEX0448 029. nt. 2 


38382.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 029. nt. 2 


38383.0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 -029. nt. 2 


38384.0 


0.0 


0.0 


10.0 


10.0 - 


0.0 


0.0 


4.0 


4.0 


DEX0448 029. nt. 3 


28733.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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DEX0448 029. nt. 3 


28734 . 0 


3.6 


3.6 


40.0 


40.0 


15.4 


15.4 


12.0 


12 .0 


DEX0448 029. nt. 3 


38381 . 0 


0.0 


0.0 


10.0 


11.1 


7.7 


7.7 


0.0 


0.0 


DEX0448 029. nt. 3 


38382 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 02S.nt.3 


38383 . 0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 029.nt.3 


36384 . 0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 029. nt. 4 


28733 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX044 8 029. nt. 4 


28734 . 0 


3.6 


3.6 


40.0 


40.0 


15.4 


15.4 


12.0 


12.0 


DEX0448 029. nt. 4 


38381 . 0 


0.0 


0.0 


10.0 


11.1 


7.7 


7.7 


0.0 


0.0 


DEX0448 029. nt. 4 


38382 . 0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 029.nt.4 


38383 . 0 


0.0 


0 . 0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 029. nt. 4 


38384 . 0 


0 . 0 


0 . 0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 029. nt. 5 


28733 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 029. nt. 5 


28734 . 0 


3 . 6 


3 . 6 


40.0 


40.0 


15.4 


15.4 


12.0 


12.0 


DEX044 8 029. nt. 5 


38381 . 0 


0 . 0 


0.0 


10.0 


11.1 


7.7 


7.7 


0.0 


0.0 


DEX0448 029. nt. 5 


38382 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 029. nt. 5 


38383 . 0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX044 8 029. nt. 5 


38384.0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 029. nt. 6 


28733 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 029. nt. 6 


28734 . 0 


3.6 


3.6 


40 . 0 


40.0 


15.4 


15 .4 


12.0 


12.0 


DEX0448 029. nt. 6 


38381.0 


0.0 


0.0 


10.0 


11.1 


7.7 


7.7 


0.0 


0.0 


DEX0448 029. nt. 6 


38382 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 029. nt. 6 


38383 . 0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 029. nt. 6 


38384,0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 029. nt. 7 


28733.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 029. nt. 7 


28734.0 


3.6 


3.6 


40.0 


40.0 


15.4 


15.4 


12.0 


12.0 


DEX044e 029. nt. 7 


38381.0 


0.0 


0.0 


10.0 


11.1 


7.7 


7.7 


0.0 


0.0 


DEX0448 029. nt. 7 


38382.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 029. nt. 7 


38383 . 0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 029. nt. 7 


38384 . 0 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


4.0 


4.0 


DEX0448 O30.nt.l 


29603 . 0 


3.6 


3.6 


10.0 


10.0 


0.0 


0.0 


8.0 


8.0 


DEX0448 030.nt.l 


29604 . 0 


7 . 1 


7.4 


10.0 


10.0 


0.0 


0.0 


12.0 


12.5 


DEX044 8 OSO.nt.l 


40867 . 0 


7.1 


8.0 


20.0 


20.0 


15.4 


16.7 


8.0 


8.7 


DEX0448 030.nt.l 


40868.0 


7.1 


7.1 


30.0 


30.0 


15.4 


15.4 


12.0 


12.0 


DEX0448 032.nt.l 


36179. 0 


17.9 


18.5 


10.0 


11.1 


7.7 


8.3 


20.0 


20.8 


DEX0448 032.nt.l 


36180.0 


10.7 


11.1 


10.0 


11.1 


7.7 


8.3 


12.0 


12.5 


DBX0448 032,nt.l 


36182.0 


3.6 


4.2 


0.0 


0.0 


7.7 


10.0 


0.0 


0.0 


DEX0448 032.nt.l 


36184.0 


39.3 


39.3 


30.0 


30.0 


46.2 


46.2 


32.0 


32.0 


DEX044 8 032.nt.l 


37227.0 


7.1 


8.3 


0.0 


0.0 


7.7 


11.1 


4.0 


4.5 


DEX0448 032.nt.l 


37228 . 0 


3 .6 


4.0 


0.0 


0.0 


7.7 


10.0 


0.0 


0.0 


DEX0448 032.nt.l 


37239.0 


28.6 


28.6 


30.0 


30.0 


38.5 


38.5 


24.0 


24.0 


DEX0448 032.nt.l 


37240. 0 


32.1 


33 .3 


40.0 


40.0 


46.2 


46.2 


28.0 


29.2 


DEX0448 035-nt.l 


23862,0 


14 .3 


14.3 


30.0 


30.0 


23.1 


23.1 


16.0 


16.0 


DEX0448 035.nt.l 


23863.0 


25 . 0 


25.0 


30.0 


30.0 


30.8 


30.8 


24.0 


24.0 


DEX0448 035. nt. 2 


23862 . 0 


14 .3 


14 .3 


30.0 


30.0 


23.1 


23.1 


16.0 


16.0 


DEX0448 035. nt. 2 


23863.0 


25.0 


25.0 


30.0 


30.0 


30.8 


30.8 


24.0 


24.0 


DEX044 8 035.nt.3 


23862. 0 


14.3 


14.3 


30.0 


30.0 


23 .1 


23.1 


16.0 


16.0 


DEX0448 035. nt. 3 


23863.0 


25.0 


25.0 


30.0 


30.0 


30.8 


30.8 


24.0 


24.0 


DEX0448 035. nt. 4 


23862.0 


14.3 


14.3 


30.0 


30.0 


23.1 


23.1 


16.0 


16.0 


DEX0448 035. nt. 4 


23863.0 


25.0 


25.0 


30.0 


30.0 


30.8 


30.8 


24.0 


24,0 


DEX0448 036.nt.l 


29532.0 


17.9 


17.9 


50.0 


50.0 


38.5 


38,5 


20.0 


20.0 


DEX0448 036.nt.l 


29533.0 


0.0 


0.0 


0.0 


0-0 


0.0 


0.0 


0.0 


0.0 


DEX0448 036.nt.l 


29534.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 036.nt.l 


29540.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 036.nt.l 


36811.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 036.nt.l 


36812.0 


0.0 


0.0 


10.0 


12.5 


7.7 


11.1 


0.0 


0.0 


DEX0448 036. nt. 2 


29532.0 


17.9 ■ 


17.9 


50.0 


50.0 


38.5 


38.5 


20.0 


20-.*D 


DEX044e 036. nt. 2 


29539.0 


0.0 


0.0 


10.0 


11.1 


7.7 


7.7 


0.0 


0.0 
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DEX0448_036.nt,2 


29540.0 


0.0 


0,0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 036. nt. 3 


29532 .0 


17.9 


17.9 


50.0 


50.0 


38.5 


38 .5 


20.0 


20.0 


DEX0448 036. nt. 3 


29539.0 


0.0 


0.0 


10.0 


11.1 


7.7 


7.7 


0.0 


0.0 


DEX0448 036. nt. 3 


29540.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 037.nt.l 


36341.0 


7.1 


7.1 


20.0 


20.0 


30.8 


30.8 


0.0 


0.0 


DEX044 8 037.nt.l 


36342.0 


3.6 


3.8 


20.0 


20.0 


23.1 


25 .0 


0.0 


0.0 


DEX0448 037. nt. 2 


36341.0 


7.1 


7.1 


20.0 


20.0 


30.8 


30 .8 


0.0 


0.0 


DEX0448_037.nt.2 


36342.0 


3.6 


3.8 


20.0 


20.0 


23 .1 


25.0 


0.0 


0.0 


DEX0448 037. nt. 3 


36341.0 


7.1 


7.1 


20.0 


20.0 


30.8 


30.8 


0.0 


0.0 


DEX0448 037. nt. 3 


36342.0 


3.6 


3.8 


20.0 


20.0 


23 .1 


25.0 


0.0 


0.0 


DEX0448_037.nt .4 


36341.0 


7.1 


7.1 


20.0 


20.0 


30.8 


30.8 


0,0 


0.0 


DEX0448 037. nt. 4 


36342 . 0 


3.6 


3.8 


20.0 


20.0 


23 . 1 


25. 0 


0.0 


0.0 


DEX0448 037. nt. 5 


36341.0 


7.1 


7 . 1 


20. 0 


20.0 


30.8 


30.8 


0.0 


0.0 


DEX0448 037.nt.5 


36342 .0 


3.6 


3.6 


20.0 


20.0 


23 .1 


25.0 


0.0 


0.0 


DEX0448 037. nt. 6 


36341.0 


7.1 


7.1 


20.0 


20.0 


30.8 


30.8 


0.0 


0.0 


DEX0448 037. nt. 6 


36342.0 


3.6 


3.8 


20.0 


20.0 


23.1 


25.0 


0.0 


0.0 


DEX0448 038.nt.l 


20695.0 


28.6 


28.6 


30.0 


33.3 


84.6 


91.7 


0.0 


0.0 


DEX044 8 038.nt.l 


20896.0 


28.6 


28.6 


40.0 


40.0 


92.3 


92.3 


0.0 


0.0 


DEX0448 039.nt.l 


38855.0 


28.6 


28.6 


50.0 


50.0 


23.1 


23 .1 


40.0 


40.0 


DEX0448_040.nt.l 


31347.0 


32.1 


33 .3 


30.0 


30.0 


53 .8 


53.8 


20.0 


20.8 


DEX044 8 04 0.nt.l 


40755.0 


28.6 


28.6 


30.0 


30.0 


53.8 


53.8 


16.0 


16.0 


DEX0448_040.nt .1 


40756.0 


32.1 


32.1 


30.0 


30.0 


53.8 


53 .8 


20.0 


20.0 


DEX0448 040. nt. 2 


31346.0 


35.7 


35.7 


40.0 


40.0 


61.5 


61.5 


24.0 


24.0 


DEX0448 040. nt. 2 


31347.0 


32.1 


33 .3 


30.0 


30.0 


53.8 


53.8 


20.0 


20.8 


DEX0448 040. nt. 2 


40755.0 


28.6 


28.6 


30.0 


30.0 


53.8 


53.8 


16.0 


16.0 


DEX0448 040. nt. 2 


40756.0 


32.1 


32.1 


30.0 


30.0 


53.8 


53.8 


20.0 


20.0 


DEX0448 040. nt. 3 


31346.0 


35.7 


35.7 


40.0 


40.0 


61.5 


61.5 


24. 0 


24.0 


DEX0448 040.nt.3 


31347.0 


32.1 


33.3 


30.0 


30.0 


53.8 


53.8 


20.0 


20.8 


DEX0448 040. nt. 3 


40755.0 


28.6 


28.6 


30.0 


30.0 


53.8 


53.8 


16.0 


16.0 


DEX0448 040. nt. 3 


40756.0 


32.1 


32.1 


30.0 


30.0 


53.8 


53.8 


20.0 


20.0 


DEX044 8 04 0.nt.4 


31346.0 


35.7 


35.7 


40,0 


40.0 


61.5 


61.5 


24.0 


24.0 


DEX0448 040. nt. 4 


31347.0 


32.1 


33.3 


30.0 


30.0 


53 .8 


53.8 


20.0 


20.8 


DEX044 8 04 0.nt.4 


40755.0 


28.6 


28.6 


30.0 


30.0 


53.8 


53.8 


16.0 


16.0 


DEX0448 040. nt. 4 


40756. 0 


32.1 


32.1 


30.0 


30.0 


53 .8 


53.8 


20.0 


20,0 


DEX0448 040. nt. 5 


31346.0 


35.7 


35 .7 


40.0 


40.0 


61.5 


61.5 


24.0 


24 .0 


DEX044 8 040.nt.5 


31347.0 


32.1 


33.3 


30.0 


30.0 


53.8 


53.8 


20.0 


20.8 


DEX0448 040. nt. 5 


40755.0 


28.6 


28.6 


30.0 


30. 0 


53 .8 


53.8 


16.0 


16.0 


DEX0448 04 0.nt.5 


40756.0 


32.1 


32.1 


30.0 


30.0 


53.8 


53.8 


20.0 


20.0 


DEX044 8 04 0.nt.6 


31346.0 


35.7 


35.7 


40.0 


40.0 


61.5 


61.5 . 


24.0 


24.0 


DEX0448 040. nt. 6 


31347.0 


32.1 


33.3 


30.0 


30.0 


53.8 


53.8 


20.0 


20.8 


DEX0448 040. nt. 6 


35943 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX044e 040. nt. 6 


35944 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 040. nt. 6 


40755.0 


28.6 


28.6 


30.0 


30.0 


53.8 


53.8 


16.0 


16.0 


DEX0448 040. nt. 6 


40756 .0 


32.1 


32.1 


30.0 


30.0 


53 .8 


53 .8 


20.0 


20.0 


DEX0448 040. nt. 7 


31346.0 


35.7 


35.7 


40.0 


40.0 


61.5 


61.5 


24.0 


24 .0 


DEX0448 040. nt. 7 


31347.0 


32.1 


33.3 


30.0 


30.0 


53.8 


53.8 


20.0 


20.8 


DEX0448 040. nt. 7 


40755.0 


28.6 


28.6 


30.0 


30.0 


53.8 


53.8 


16.0 


16.0 


DEX0448 040. nt. 7 


40756.0 


32.1 


32.1 


30.0 


30.0 


53.8 


53.8 


20.0 


20.0 


DEX0448 041.nt.l 


32216.0 


50.0 


50.0 


60.0 


60.0 


61.5 


61.5 


48.0 


48.0 


DEX044 8 042.nt.l 


29231.0 


17.9 


18.5 


10.0 


10.0 


23.1 


23.1 


12.0 


12.5 


DEX0448 042.nt.l 


29232.0 


17.9 


17.9 


10.0 


10.0 


15.4 


15.4 


16.0 


16.0 


DEX0448 042.nt.l 


29271.0 


21.4 


21.4 


30.0 


30.0 


30.8 


30.8 


20.0 


20.0 


DEX0448 042.nt.l 


29272.0 


21.4 


21.4 


40.0 


40.0 


30.8 


30.8 


24.0 


24.0 


DEX0448 042.nt.l 


29290. 0 


32.1 


32.1 


30.0 


30.0 


46.2 


46.2 


24.0 


24.0 


DEX0448 043.nt.l 


9043 . 0 


14.3 


14.3 


30.0 


30.0 


23.1 


23.1 


16.0 


16.0 


DEX044e 044.nt.l 


36803.-0 


32.1 


32.1 


20.0 


20.0 


0.0 


0.0 


44.0 


44.0 
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Table 3. 



DEX ID 


Oligo 
Mame 


Cln 
Multi- 
Cancer 
ALL %up 
n=27 


Cln 
Multi- 
Cancer 

TVT T 

"tfvaiiQ 

up ns^; / 


Cln 
Multi- 
Cancer 
ASC %up 
n=14 


Cln 
Multi- 
Cancer 

ASu 
•fi vaj. lo 
up n=14 


Cln 
Multi- 
Cancer 
RS %up 
n=13 


Cln 
Multi- 
Cancer 

RS 

? vai> iQ 

up 11 — XJ 




COCA A 




1 / . o 


u . u 


w . u 


. J. 


AO Q 


U C< A vrtlO UX?.I1U>X 






O C Q 








J O . 3 


DEX0448 026.nt.l 


78479 . 0 


18 . 5 


20.8 


35 . 7 


41.7 


0 . 0 


0 . 0 


DEX0448 026.nt.l 


78479.1 


14.8 


15.4 


28.6 


28 .6 


0,0 


0.0 


DEX0448 026. nt. 2 


78479.0 


18.5 


20,8 


35.7 


41.7 


0.0 


0.0 


DEX0448 026. nt, 2 


78479.1 


14.8 


15.4 


28.6 


28.6 


0.0 


0.0 


DEX0448 028.nt.l 


5305.0 


14.8 


14.8 


28.6 


28.6 


0.0 


0.0 


DEX0448 028.nL.l 


5306.0 


14.8 


14.8 


28.6 


28.6 


0.0 


0.0 


DEX0448 028. nt. 2 


5305.0 


14.8 


14.8 


28.6 


28.6 


0.0 


0.0 


DEX0448 028. nt. 2 


5306.0 


14.8 


14.8 


28.6 


28.6 


0.0 


0.0 


DEX0448 034.nt.l 


5354.0 


11.1 


17,6 


0.0 


0.0 


23 .1 


42.9 


DEX0448 044.nt.l 


42013.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0,0 


DEX04 4 8 044.nt.l 


42013 .1 


0.0 


0.0 


0.0 


0.0 


0.0 


0,0 


DEX0448 044.nt.l 


42013.2 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 



BREAST CANCER CHIPS 

For breast cancer two different chip designs were evaluated with overlapping sets 
5 of a total of 36 samples, comparing the expressjion patterns of breast cancer derived 
polyA+ RNA to polyA+ RNA isolated from a pool of 10 normal breast tissues. For the 
Breast Array Chip, all 36 samples (9 stage I cancers, 23 stage II cancers, 4 stage III 
cancers) were analyzed. These samples also represented 10 Grade 1/2 and 26 Grade 3 
cancers. The histopathologic grades for cancer are classified as follows: GX, cannot be 

10 assessed; Gl, well differentiated; G2, moderately differentiated; G3, poorly differentiated; 
and G4, undifferentiated. AJCC Cancer Staging Handbook, pp. 9, (5th Ed, 1998). 
Samples were fiirther grouped based on the expression patterns of the known breast cancer 
associated genes Her2 and ERa (10 HER2 up, 26 HER2 not up, 20 ER up and 16 ER not 
up) and for the Multi-Cancer Array Chip, a subset of 20 of fliese samples (9 stage I 

15 cancers, 8 stage 11 cancers, 3 stage III cancers) were assessed. 

The results for the statistically significant up-regulated genes on the Breast Array 
Chip are shown in Tables 4 and 5. The results for the statistically significant up- 
regulated genes on the Multi-Cancer Array Chip are shown in Table 6. The first two 
columns of each table contain information about the sequence itself (Seq ID, Oligo 

20 Name), the next columns show the results obtained for all ("ALL") breast cancer 

samples, cancers jcorresponding to stagel ("STl"), stages II and III ("ST2,3"). grades 1 
and 2 ("GR1,2"), grade 3 ("GR3"), cancers exhibiting up-regulation of Her2 ("HER2up") 
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or ERa ("ERup") or those not exhibiting up-regulation of Her2 ("NOT HER2up") or ERa 
("NOT ERup"). '%up' indicates the percentage of all experiments in which up- 
regulation of at least 2-fold was observed (n=36 for Colon Array Chip, n=20 for the 
Multi-Cancer Array Chip), '%valid up' indicates the percentage of experiments with 
5 valid expression values in which up-regulation of at least 2-fold was observed. 



Table 4. 



DEX ID 


Oligo 
Name 


Mam 
ALL 
%up 
n=36 


Mam 
ALL % 
valxd 
up 

ii — J o 


Mam 
STl 
%up 
n=9 


Mam 
STl 
% 

valid 

up 

n=9 


Mam 
ST2,3 
%up 
n=27 


Mam 

ST2,3 

% 

valid 

up 

n=27 


Mam 
GR1,2 
%up 
n=10 


Mam 
GR1,2 

% 

valid 

up 

n=10 


Mam 
GR3 
%up 
n=26 


Mam 
GR3 % 
valid 
up 
n=26 


DEX0448 OlO.nt.l 


17869.0 


8.3 


8.3 


11.1 


11.1 


7.4 


7.4 


0.0 


0.0 


11.5 


11.5 


DEX0448 OlO.nt.l 


31466.0 


8.3 


8.3 


0.0 


0.0 


11.1 


11.1 


0.0 


0.0 


11.5 


11,5 


DEX0448 Oll.nt.l 


20711.0 


16.7 


16.7 


0.0 


0.0 


22.2 


22.2 


0.0 


0.0 


23.1 


23.1 


DEX0448 Oie.nt.l 


22545. 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 OlO.nt.l 


22546 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 Olb.nt.l 


26543 . 0 


2.8 


2.8 


0.0 


0,0 


3.7 


3.7 


0.0 


0.0 


3.8 


3.8 


DEX0448 016, nt. 2 


22545.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 2 


22546.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 2 


26543.0 


2.8 


2.8 


0.0 


0.0 


3.7 


3.7 


0.0 


0.0 


3.8 


3.8 


DEX0448 016. nt. 3 


22545.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 3 


22546.0 


0.0 


0.0 


0,0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 3 


26543.0 


2.8 


2.8 


0.0 


0.0 


3.7 


3.7 


0.0 


0.0 


3.8 


3.8 


DEX0448 017.nt.l 


29052,0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 Oie.nt.l 


13245.0 


11.1 


11.1 


22.2 


22.2 


7.4 


7.4 


10.0 


10.0 


11.5 


11.5 


DEX0448 OlS.nt.l 


13299.0 


11.1 


11.1 


11.1 


11.1 


11.1 


11.1 


0.0 


0.0 


15.4 


15.4 


DEX0448 Oie.nt.l 


13320.0 


5.6 


5.6 


11,1 


11.1 


3.7 


3.7 


0.0 


0.0 


7.7 


7.7 


DEX0448 OlS.nt.l 


13321 . 0 


11.1 


11.1 


11.1 


11,1 


11.1 


11.1 


0.0 


0.0 


15.4 


15.4 


DEX0448 Oie.nt.l 


13323.0 


13.9 


13 .9 


11. 1 


11.1 


14.8 


14 .8 


0.0 


0.0 


19.2 


19.2 


DEX0448 Oie.nt.l 


13337.0 


11.1 


11.1 


22.2 


22.2 


7.4 


7,4 


10.0 


10.0 


11.5 


11.5 


DEX0448 OlS.nt.l 


13338.0 


5.6 


5.7 


11.1 


11.1 


3.7 


3.8 


0.0 


0.0 


7.7 


8.0 


DEX0448 022.nt.l 


32150.0 


36.1 


36.1 


55.6 


55.6 


29.6 


29.6 


20.0 


20.0 


42.3 


42.3 


DEX0448 022.nt.l 


32151.0 


22.2 


22.2 


44 .4 


44 .4 


14.8 


14.8 


10.0 


10.0 


26.9 


26.9 


DEX0448 022. nt. 2 


32150.0 


36.1 


36.1 


55.6 


55.6 


29.6 


29.6 


20.0 


20.0 


42.3 


42.3 


DEX0448 022. nt. 2 


32151.0 


22.2 


22. 2j 


44 .4 


44 .4 


14.8 


14.8 


10.0 


10.0 


26.9 


26.9 


DEX0448 027.nt.l 


15310.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027.nt.l 


15311.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027.nt.l 


15833.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027.nt.l 


15834.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027.nt.l 


15835.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027.nt.l 


15836.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027.nt.l 


15845.0 


2.8 


2.9 


0.0 


0.0 


3.7 


3.7 


0.0 


0.0 


3.8 


4.0 


DEX0448 027.nt.l 


15646.0 


2.8 


2.8 


0,0 


0.0 


3.7 


3.7 


0.0 


0.0 


3.8 


3.8 


DEX0448 027.nt.l 


32136.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027.nt.l 


32137.0 


2.8 


2.8 


0.0 


0.0 


3.7 


3.7 


0.0 


0.0 


3.8 


3.8 


DEX0448 027. nt. 2 


15310.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 2 


15311.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0,0 


0.0 


0.0 


DEX0448 027. nt. 2 


15833.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 2 


15834.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0,0 


0.0 


DEX0448 027.nt..2 


15835.0 


0.0 


0.0 


0..0 


0.0 


0.0 . 


0.0^ 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 2 


15836.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 2 


15845.0 


2.8 


2.9 


0.0 


0.0 


3.7 


3.7 


0.0 


0.0 


3.8 


4.0 



wo 2004/050860 



PCT/US2003/040063 



193 



DEX0448 027. nt. 2 


15846.0 


2 . 8 


2 . 8 


0.0 


0.0 


3.7 


3 . 7 


0 . 0 


0 . 0 


3 . 8 


3 . 8 


DEX0448 027. nt. 2 


32136.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0.0 


0 . 0 


0 . 0 


DEX0448 027.nt.2 


32137.0 


2.8 


2.8 


0.0 


0.0 


3.7 


3.7 


0.0 


0 . 0 


3 . 8 


3 . 8 


DEX0448 027. nt. 3 


15310.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 3 


15311. 0 


0.0 


0-0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 3 


15833 . 0 


0.0 


0 . 0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 3 


15834.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 3 


15835.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 , 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 3 


15836.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 , 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 3 


15845.0 


2 . 8 


2.9 


0.0 


0.0 


3.7 


3.7 


0 . 0 


0 . 0 


3 . 8 


4 . 0 


DEX0448 027. nt. 3 


15846.0 


2.8 


2.8 


0.0 


0.0 


3.7 


3.7 


0.0 


0 . 0 


3 . 8 


3 . 8 


DEX0448 027. nt. 3 


32136.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0.0 


DEX0448 027. nt. 3 


32137.0 


2.8 


2.8 


0.0 


0.0 


3 .7 


3-7 


0.0 


0 . 0 


3 . 8 


3 . 8 


DEX0448 027. nt. 4 


15310.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 4 


15311.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 4 


15833 .0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 , 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 4 


15834 . 0 


0.0 


0.0 


0.0 


0 . 0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 4 


15835 . 0 


0 . 0 


0.0 


0 .0' 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 4 


15836 . 0 


0 . 0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 4 


32136 .0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 4 


32137.0 


2 . 8 


2.8 


0.0 


0 . 0 


3 . 7 


3.7 


0 . 0 


0 . 0 


3 . 8 


3 . 8 


DEX0448 027. nt. 5 


15310 . 0 


0 . 0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027, nt. 5 


15311-0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 5 


15833 . 0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 5 


1S834 .0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 5 


15835.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 5 


15836.0 


0 . 0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0. 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 5 


32136 .0 


0 . 0 


0.0 


0.0 


0 . 0 


0 . 0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 5 


32137 . 0 


2 . 8 


2.8 


0.0 


0 . 0 


3 . 7 


3 . 7 


0 . 0 


0 . 0 


3 . B 


3 . 8 


DEX0448 027. nt. 6 


15310 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 6 


15311 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0. 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 6 


15835 , 0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 6 


15836 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 , 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 6 


15845 . 0 


2 . 8 


2.9 


0 . 0 


0.0 


3 . 7 


3 . 7 


0 . 0 


0 . 0 


3 . 8 


4 - 0 


DEX0448 027. nt. 6 


15846 . 0 


o o 
2 . o 


O Q 

2 . o 


D . 0 


0 - u 




6 . / 


n C\ 
\j . u 


u . u 


J . o 


"X ft 


DEX0448 027. nt. 6 


32136.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0,0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 6 


32137.0 


2.8 


2.8 


0.0 


0.0 


3 .7 


3.7 


0.0 


0.0 


3.8 


3.8 


DEX0448 033.nt.l 


22294 .0 


2.8 


2.8 


11.1 


11.1 


0.0 


0.0 


10.0 


10.0 


0.0 


0.0 


DEX0448 034.nt.l 


22545.0 


0-0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 034.nt.l 


22546.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 035.nt.l 


40309.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 043.nt.l 


22586.0 


8.3 


8.3 


0.0 


0.0 


11.1 


11.1 


0.0 


0.0 


11.5 


11.5 


DEX0448 043. nt. 2 


22585.0 


8.3 


8.3 


0.0 


0.0 


11.1 


11.1 


0.0 


0.0 


11.5 


11.5 


DEX0448 043. nt. 2 


22586.0 


8.3 


8.3 


0.0 


0.0 


11.1 


11.1 


0.0 


0.0 


11.5 


11.5 



Table 5. 



DEX ID 


Oligo 
Name 


Mam 

HER2 

up 

%up 

n=10 


Mam 

HER2 up 
%valid 
up n=10 


Mam 

NOT 

HER2 

up 

%up 

n=26 


Mam 

NOT 

HER2 

up % 

valid 

up 

n=26 


Mam 
ER 
up 
%up 
n=2 0 


Mam ER 
up % 
valid 
up 
n=20 


Mam 
NOT 
ER up 
%up 
n=16 


Mam 
NOT ER 
up % 
valid 
up 
n=16 


DEX0448 OlO.nt.l 


17869.0 


0.0 


0.0 


11.5 


11.5 


0.0 


0.0 


18.8 


18.8 


DEX0448_010.nt.l 


31466. '0 


0.0' 


0.0 


11.5 


11.5 


0.0 


0.0 


18.8 


18.8 


DEX0448 Oll.nt.l 


20711.0 


20.0 


20.0 


15.4 


15.4 


25.0 


25.0 


6.2 


6.2 
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T •? c 4 c n 

^ Z 0 D • Kj 


n n 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


riPYn4Aft Olfi nf- 1 




u . u 


D . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 , 0 


D . 0 


nT?Yn44ft 01 1 
JUCiAw^ ^ O V X O • ilu • ± 




0 0 


0 . 0 


3 . 8 


3 . 8 


0 . 0 


0 . 0 


5 . 2 


5 . 2 




22545 . 0 


n n 


0 . 0 


0 . 0 


3 , 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


nPYn4 4.fl mfi ^ 




n 0 


0 . 0 


0 . 0 


3 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


nPYri44R nifi nt- 


T c c A "5 n 


n 0 


0 . 0 


3 . 8 


3 . 8 


0 . 0 


0 . 0 


6 . 2 


6.2 


nrYnddw mfi nt- 

iJC(AU*±*xD \JXD > llU .J 


Z Z 3 1 3 . \J 


0 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 , 0 


0 . 0 


0 . 0 




0 9 c A r n 


n 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 , 0 


0 . 0 


0 . 0 






0 0 


0 . 0 


3 . 8 


3 . 8 


0 . 0 


0 . 0 


6.2 


6 . 2 




2Q057 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


nT7Yri4 4P ftl ft nh 1 


13245 . 0 


0 . 0 


0 . 0 


15 . 4 


15 . 4 


10.0 


10.0 


12 .5 


12 .5 




X J . u 


0 0 


0 0 


15 . 4 


15 . 4 


5 . 0 


5 . 0 


18 . 8 


18 . 8 


UC<AUfxrcO UXO.nL.X 


X J 0 V. u . u 


0 0 


0 . 0 


7 . 7 


7 . 7 


0 . 0 


0 . 0 


12 . 5 


12.5 


utiAU^^o uxo.nu«x 




0 0 


0 . 0 


15 . 4 


15 . 4 


5 . 0 


5 . 0 


18 . 8 


18 . 8 


nTTYAAAP mft 1 
UCiAU^^O uXO.Iit.X 


X.} J £^ . w 


10 0 


10 . 0 


15 .4 


15 .4 


5 . 0 


5 . 0 


25 . 0 


25 . 0 


nTrYn44Q mft i 


1 7777 0 


0 . 0 


0 . 0 


IS . 4 


15 . 4 


5 . 0 


5 . 0 


18 . 8 


18 . 8 


n"PY044fl ftlR Tit" 1 


13338 . 0 


0 . 0 


0 . 0 


7 . 7 


8 , 0 


0 . 0 


0 . 0 


12 .5 


12.5 


'n'CYn44fl 1 
iJiliA UnrtO U^^.ilU.X 


7 7 1 *^ 0 0 

0 ^ X U . V 


4 0.0 


40.0 


34 . 6 


34 . 6 


25 . 0 


25 . 0 


50 . 0 


50.0 


nFYn4 4fl nh T 

iJCjAU'l'xO .111- . X 


32151 . 0 


20.0 


20 . 0 


23 . 1 


23 . 1 


10 . 0 


10 . 0 


37.5 


37.5 


U&A *1 O U^^.IIL.*^ 


32150.0 


4 0.0 


40 . 0 


34 . 6 


34 . 6 


25 . 0 


25 . 0 


50.0 


50.0 


nT?Y04 4fl n'?^ •n^ 5 

XJIj A vrt^O v/<b^.iiU.^ 


32151 . 0 


20.0 


20 . 0 


23 . 1 


23 . 1 


10 . 0 


10 . 0 


37.5 


37.5 


nC>Yn44fi 0*5*7 1 


15310 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


nPYnd4fl 0^7 1 


15311 . 0 


0 . 0 


0.0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0 . 0 


0 . 0 


r»T?YA44P 0'?7 1 


15833 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


nPYn44fl n'?7 n1- 1 


1 R fl74 n 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


0.0 




15835 . 0 


0 . 0 


0 . 0 


0 , 0 


0 . 0 


0.0 


0 . 0 


0.0 


0.0 


ni7Yn4 4fl n^7 ni- 1 


X 3 0 J D . \J 


0 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0 . 0 


T>PY0AAQ n'DT -nh 1 


15845 . 0 


10.0 


10 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


6.2 


6.2 


ue<AU^4o / . ill- . X 


T C Q A Cl 
X D 0 ** D . V 


10 . 0 


10.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


6-2 


6.2 




.3 4i X J 0 . U 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


nPYn44P 077 1 
iJCiAU*t^O / . Ill- » X 


32 13 7 . 0 


10.0 


10.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


6.2 


6.2 


nT?Y04 4P 07 7 nt" 7 


15310 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0. 0 


0.0 


0.0 


■nPY044ft 077 n^ 7 


15311 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


nT?Y04 4A 0 77 nt" 7 


15633 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0. 0 


0 . 0 


0.0 


0.0 


nPY044P 077 T»^ 7 


15834 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0 . 0 


0 . 0 


0.0 


nPY044fl 077 nt" 7 


15835 . 0 


0 .0 


0 . 0 


0.0 


0 . 0 


0 , 0 


0 . 0 


0 .0 


0 . 0 


nPY044fl 077 nt- 7 


15836.0 


0 . 0 


0. 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


ni?Y044P n77 nt" 7 




10 . 0 


10.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


6-2 


6.2 


■nCVAAAQ AO"? r^i- '7 
L)£«AU44o UZ/.nu.Z 


X ^ 0 1 0 . U 


10.0 


10.0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


6.2 


6 . 2 


r>'PY044Q 0*57 nt" 9 


32136 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0 . 0 


0.0 


T^T7 Y riAAQ HOT ■nt' "5 


771 77 0 


10.0 


10 . 0 


0 . 0 


0 . 0 


0 . 0 


0. 0 


6.2 


6.2 


tmtyoaaq no*? nt- 
JJCt A U^^o UZ/.nu.J 


XS J X V • V 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


rit?YOAAQ no "7 nt- "5 


X 3 >^ X X . V 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0. 0 


0.0 


0.0 


nPYHAAfi n*?"? n^ 
UCiAv^^ 0 UZ/.Xl^.-3 


15833 . 0 


0 . 0 


0 , 0 


0 . 0 


0 . 0 


0 . 0 


0 , 0 


0.0 


0.0 


nT?Y044fi A*?*? nt- ^ 


1 ^fl74 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


T4CVrt>l>lO />*>*7 «^ 0 

U£iAU44o 02/>Zlu..5 


1 c;fi7Q 0 


0 0 


0 0 


0-0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 027. nt. 3 


15836.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0,0 


DEX0448 027. nt. 3 


15845.0 


10.0 


10.0 


0.0 


0.0 


0.0 


0.0 


6.2 


6.2 


DEX0448 027. nt. 3 


15846.0 


10.0 


10.0 


0.0 


0.0 


0.0 


0.0 


6.2 


6.2 


DEX0448 027. nt. 3 


32136.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 3 


32137.0 


10.0 


10.0 


0.0 


0.0 


0.0 


0.0 


6.2 


6.2 


DEX0448 027. nt. 4 


15310.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 4 


15311.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 4 


15833.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 4 


15834.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 4 


15835.0 


0.0 


0.0 


0.0 


0:0 • 


0.0 


0.0 


0.0 


0.0 


DEX044 8 027. nt. 4 


15836.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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DEX0448 027. nt. 4 


32136 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX044 8 027. nt. 4 


32137 .0 


10 . 0 


10.0 


0.0 


0.0 


0.0 


0.0 


6.2 


6.2 


DEX044 8 027.nt.5 


15310 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX04 4 8 02 7.nt.5 


15311 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 5 


15833 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 5 


15834 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 5 


15835 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 5 


15836 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 5 


32136 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 5 


32137 .0 


10.0 


10.0 


0.0 


0.0 


0.0 


0.0 


6.2 


6.2 


DEX0448 027. nt. 6 


15310 . 0 


0.0 


0 . 0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027.nt.6 


15311 . 0 


0.0 


0 . 0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt, 6 


15835 . 0 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 6 


15836 . 0 


0. 0 


0 . 0 


0 . 0 


0 . 0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 6 


15845.0 


10.0 


10.0 


0.0 


0.0 


0.0 


0.0 


6.2 


6.2 


DEX0448 027. nt. 6 


15846,0 


10.0 


10.0 


0.0 


0.0 


0.0 


0.0 


6.2 


6.2 


DEX0448 027. nt. 6 


32136.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 027. nt. 6 


32137.0 


10.0 


10.0 


0.0 


0.0 


0.0 


0.0 


6.2 


6 .2 


DEX0448 033.nt.l 


22294.0 


0.0 


0.0 


3.8 


3.8 


5.0 


5.0 


0.0 


0.0 


DEX044 8 034.nt.l 


22545.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 034.nt.l 


22546.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 035.nt.l 


40309.0 


0.0 


0,0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 043.nt.l 


22586.0 


0.0 


0.0 


11.5 


11.5 


5.0 


5.0 


12.5 


12.5 


DEX0448 043. nt. 2 


22585.0 


0.0 


0.0 


11.5 


11.5 


5.0 


5.0 


12.5 


12.5 


DEX0448 043. nt. 2 


22586.0 


0.0 


0.0 


11.5 


11.5 


5.0 


5.0 


12.5 


12.5 



Table 6. 



DEX ID 


Oligo 
Name 


Mam 
Multi- 
Cancer 
ALL %up 
n=20 


Mam 
Multi- 
Cancer 
ALL 

%valid 
uv n-20 


Mam 
Multi- 
Cancer 
STl %up 
n=9 


Mam 
Multi- 
Cancer 
STl 

%valid 
up n=9 


Mam 
Multi- 
Cancer 
ST2,3 
%up 
n=ll 


Mam 

Multi- 

Cancer 

ST2,3 

%valid 

up n=ll 


DEX0448 Oie.nt.l 


5354.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 OlS.nt.l 


1045.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 026.nt.l 


78479.0 


0.0 


0.0 


0.0 


0.0 


0,0 


0.0 


DEX0448 026.nt.l 


78479.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX044e 026. nt, 2 


78479.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 026. nt. 2 


78479.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028.nt.l 


5305.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028.nt.l 


5306.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028. nt. 2 


5305.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028. nt. 2 


5306.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 034.nt.l 


5354.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 044.nt.l 


42013.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 044,nt.l 


42013 .1 


0.0 


0.0 


0,0 


0,0 


0.0 


0.0 


DEX044 8 044.nt.l 


42013 .2 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 



LUNG CANCER CHIPS 
5 For lung cancer two different chip designs were evaluated with overlapping sets of 

a total of 29 samples, comparing the expression patterns of lung cancer derived polyA+ 
RNA to polyA+ RNA isolated from a pool of 12 normal lung tissues. For the Lung Array 
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Chip all 29 samples (15 squamous cell carcinomas and 14 adenocarcinomas including 14 
stage I and 1 5 stage Mil cancers) were analyzed and for the Multi-Cancer Array Chip a 
subset of 22 of these samples (10 squamous cell carcinomas, 12 adenocarcinomas) were 
assessed. 

5 The results for the statistically significant up-regulated genes on the Lung Array 

Chip are shown in Table 7. The results for the statistically significant up-regulated genes 
on the Multi-Cancer Array Chip are shown in Table 8. The first two columns of each 
table contain information about the sequence itself (DEX ID, Oligo Name), the next 
columns show the results obtained for all ("ALL") lung cancer samples, squamous cell 

10 carcinomas ("SQ"), adenocarcinomas ("AD"), or cancers corresponding to stage I 

("STl"), or stages II and III ("ST2,3"). *%up' indicates the percentage of all experiments 
in which up-regulation of at least 2-fold was observed (n=29 for Lung Array Chip, n=22 
for Multi-Cancer Array Chip), '%valid up' indicates the percentage of experiments with 
valid expression values in which up-regulation of at least 2-fold was observed. 

15 



Table 7. 



DEX ID 


Oligo 
Name 


Ling 
PiLL 
%up 
n=29 


Lng 
ALL % 
valid 
up 

n=29 


Lng 
SQ 
%up 
n=15 


Lng 
SQ % 
valid 
up 

n=15 


Lng 
AD 
%up 
n=14 


Lng 
AD % 
valid 
up 

n=14 


Lng 
STl 
%up 
n=14 


Lng 
STl % 
valid 
up 
n=14 


Lng 
ST2,3 
%up 
n=15 


Lng 

ST2,3 

% 

valid 

up 

n=15 


DEX0448 OlO.nt.l 


835.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 OlO.nt.l 


836.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 OlO.nt.l 


953.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 OlO.nt.l 


955.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 OlO.nt.l 


5749.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 Oie.nt.l 


5353.0 


13.8 


13.8 


20.0 


20.0 


7.1 


7.1 


14.3 


14.3 


13.3 


13 .3 


DEX0448 OlS.nt.l 


1044.0 


13.8 


13.8 


13,3 


13.3 


14.3 


14.3 


14.3 


14.3 


13.3 


13.3 


DEX0448 022.nt.l 


791.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 022.nt.l 


792.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 022.nt.l 


803.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 022.nt.l 


804.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 022.nt.l 


855.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 022. nt. 2 


791.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 022. nt. 2 


792.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 022. nt. 2 


803.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 022. nt. 2 


804.0 


0.0 


0.0 


0.0 


0.0 


0,0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 022. nt. 2 


855.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX044 8 028.nt.l 


1262.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0-0 


0.0 


DEX0448 028.nt.l 


1263 .0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028.nt.l 


1264.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX044 8 028.nt.l 


1265.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028.nt.l 


1320.0 


0.0 


0.0 


0.0 


o-.-o 


o-.o 


G-.O 


0.0 


0.0 


0.0 


0.0 


DEX0448 028. nt. 2 


1262.0 


0.0 


0.0 


0.0 


0.0 


0,0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028. nt. 2 


1263.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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DSX0448 028, nt. 2 


1264.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028. nt. 2 


1265.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0,0 


0.0 


DEX0448 028. nt. 2 


1320.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


DEX0448_034.nt.l 


5353.0 


13.8 


13 .8 


20.0 


20.0 


7.1 


7.1 


14.3 


14.3 


13.3 


13 .3 


DEX0448 OSS.nt.l 


3729.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0 . 0 


DEX0448 040. nt. 2 


7434.0 


10.3 


11.5 


6.7 


7.1 


14 .3 


16.7 


14 .3 


16.7 


6.7 


7.1 


DEX0448 040.nt.2 


7435.0 


10.3 


12.5 


6.7 


8.3 


14.3 


16.7 


14.3 


15.4 


6.7 


9.1 


DEX044 8_04 0.nt.3 


7434.0 


10.3 


11.5 


6.7 


7.1 


14 .3 


16.7 


14.3 


16.7 


6.7 


7.1 


DEX0448 040. nt. 3 


7435.0 


10.3 


12.5 


6.7 


8.3 


14.3 


16.7 


14.3 


15.4 


6.7 


9.1 


DEX0448_040.nt .4 


7434.0 


10.3 


11.5 


6.7 


7.1 


14.3 


16.7 


14.3 


16.7 


6.7 


7.1 


DEX044B_040.nt .4 


7435.0 


10.3 


12.5 


6.7 


8.3 


14.3 


16.7 


14.3 


15.4 


6.7 


9.1 


DEX0448 040. nt. 5 


7434.0 


10.3 


11.5 


6.7 


7.1 


14.3 


16 .7 


14 .3 


16 .7 


6,7 


7 . 1 


DEX0448 040.nt.5 


7435.0 


10.3 


12.5 


6.7 


8.3 


14.3 


16.7 


14 .3 


15.4 


6.7 


9.1 


DEX0448 040.nt.6 


7434.0 


10.3 


11.5 


6,7 


7.1 


14.3 


16.7 


14.3 


16 .7 


6.7 


7.1 


DEX0448 040. nt. 6 


7435.0 


10.3 


12.5 


6.7 


8.3 


14.3 


16.7 


14.3 


15.4 


6.7 


9.1 


DEX0448 042.nt.l 


7412.0 


13 .8 


13 .8 


20.0 


20.0 


7.1 


7.1 


14.3 


14,3 


13.3 


13.3 


DEX0448 042,nt.l 


7413.0 


17.2 


17.2 


20.0 


20.0 


14 .3 


14,3 


21.4 


21.4 


13.3 


13 .3 


DEX0448 043. nt. 2 


939.0 


3.4 


4.0 


0.0 


0.0 


7.1 


7.7 


0.0 


0.0 


6.7 


6.7 


DEX0448 043. nt. 2 


940.0 


3.4 


3 .4 


0.0 


0.0 


7.1 


7.1 


0.0 


0.0 


6.7 


6.7 


DEX0448 043. nt. 2 


943.0 


3.4 


3 .4 


0.0 


0.0 


7.1 


7.1 


0.0 


0.0 


6.7 


6.7 


DEX0448 043. nt. 2 


944 . 0 


3.4 


3.4 


0.0 


0.0 


7.1 


7.1 


0.0 


0.0 


6.7 


6.7 


DEX0448 043. nt. 2 


946.0 


3.4 


3 .4 


0.0 


0.0 


7.1 


7.1 


0.0 


0.0 


6,7 


6.7 


DEX0448 043. nt. 2 


960.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0,0 


0.0 


DEX0448 043. nt. 2 


966.0 


3.4 


3.4 


0.0 


0.0 


7.1 


7.1 


0.0 


0.0 


6.7 


6.7 


DEX044 8 044.nt.l 


42013.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 



Table 8. 



DEX ID 


Oligo 
Name 


Lng 
Multi- 
Cancer 
ALL %up 
n=22 


Lng 
Multi- 
Cancer 
ALL 
%valid 
up n=22 


Lng 
Multi- 
Cancer 
SQ %up 
n=10 


Lng 
Multi- 
Cancer 
SQ 

% valid 
up n=10 


Lng 
Multi- 
Cancer 
AD %up 
n=12 


Lng 
Multi- 
Cancer 
AD 

% valid 
up n=12 




DEX0448 Oie.nt.l 


5354.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 019.nt.l 


1045.0 


4.5 


4.5 


0.0 


0.0 


8.3 


8.3 


DEX044B 026.nt.l 


78479.0 


31.8 


36.8 


10.0 


12.5 


50.0 


54.5 


DEX0448 026.nt.l 


78479.1 


36.4 


36.4 


10.0 


10. 0 


58.3 


58.3 


DEX0448 026. nt. 2 


78479.0 


31.8 


36.8 


10.0 


12.5 


50.0 


54.5 


DEX0448 026. nt. 2 


78479.1 


36.4 


36.4 


10,0 


10.0 


58.3 


58.3 


DEX0448 028.nt.l 


5305.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028.nt.l 


5306.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028. nt. 2 


5305-0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028. nt. 2 


5306.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 034.nt.l 


5354.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 044.nt.l 


42013.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 044.nt.l 


42013.1 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 044.nt.l 


42013.2 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


nil 



OVARIAN CANCER CfflPS 
5 For ovarian cancer two different chip designs were evaluated with overlapping sets 

of a total of 19 samples, comparing the expression patterns of ovarian cancer derived total 
RNA to total RNA isolated from a pool of 9 normal ovarian tissues. For the Multi-Cancer 
Array Chip, all 19 samples (14 invasive carcinomas, 5 low malignant potential samples 
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were analyzed and for the Ovarian Array Chip, a subset of 17 of these samples (13 
invasive carcinomas, 4 low malignant potential samples) were assessed. 

The results for the statistically significant up-regulated genes on the Ovarian Array 
Chip are shown in Table 9. The results for the Multi-Cancer Array Chip are shown in 
5 Table 10. The first two columns of each table contain information about the sequence 
itself (DEX ID, Oligo Name), the next columns show the results obtained for all ("ALL") 
ovarian cancer samples, invasive carcinomas ("INV") and low malignant potential 
("LMP") samples. '%up' indicates the percentage of all experiments in which up- 
regulation of at least 2-fold was observed (n=19 for the Multi-Cancer Array Chip, n=17 
10 for the Ovarian Array Chip), '%valid up' indicates the percentage of experiments with 
valid expression values in which up-regulation of at least 2-fold was observed. 



Table 9. 



DEX ID 


Oligo 


Ovr 
ALL 
%up 
n=17 


Ovr ALL 
%valid up 
n«17 


Ovr 
INV 
%up 
n=13 


Ovr INV 
%valid up 
n=13 


Ovr 
LMP 
%up 
n=4 


Ovr LMP 
%valid up 
n=4 


DEX0448 OlO.nt.l 


9860.01 


17.6 


21.4 


15-4 


18.2 


25.0 


33.3 


DEX0448 OlO.nt.l 


9880.02 


17.6 


18.8 


15.4 


16.7 


25.0 


25.0 


DEX0448 Oll.nt.l 


22483.01 


11.8 


11.6 


7.7 


7.7 


25.0 


25.0 


DEX0448 Oll.nt.l 


22483,02 


11.8 


11.8 


7.7 


7.7 


25.0 


25.0 


DEX0448 Oie.nt.l 


21617.01 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 Ol6.nt.l 


21617.02 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 Oie.nt.l 


21619.01 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 Oie.nt.l 


21619.02 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 2 


21613.01 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 2 


21613.02 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 2 


21617.01 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 2 


21617.02 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 2 


21619.01 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 2 


21619.02 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 3 


21617 .01 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 3 


21617.02 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 3 


21619.01 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 016. nt. 3 


21619.02 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 034.nt.l 


21613.01 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 034.nt.l 


21613.02 


0.0 


0.0 


0.0 


0.0 


0,0 


0.0 


DEX0448 034.nt.l 


21619.01 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 034.nt.l 


21619.02 


0.0 


0.0 


0.0 


0.0 lo.o 


0.0 



15 Table 10. 



DEX ID 


Oligo 
Name 


Ovr 

Multi- 
Cancer 
ALL %up 
n=19 


Ovr 
Multi- 
Cancer 
ALL 
%vali(i 
up n=19 


Ovr 
Multi- 
Cancer 
INV %up 
n=14 


Ovr 
Multi- 
Cancer 

INV 

%v9lid. 
up n=14 


Ovr 
Multi- 
Cancer 
LMP %up 
n=5 


Ovr 
Multi- 
Cancer 
LMP 
%valid 
up n=5 


DEX0448 Oie.nt.l 


5354.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 
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DEX0448 OlS.nt.l 


1045. 0 


0.0 


0.0 


0 . 0 


0 , 0 


0.0 |o . 0 


DEX0448 026.nt.l 


78479.0 


0.0 


0.0 


0 . 0 


0 . 0 


0.0 |0 . 0 




/OH / y . J. 


f\ ft 
u . u 


u . u 


0 . 0 


0 . 0 


0.0 |o . 0 


DEX0448 026. nt, 2 


78479.0 


0.0 


0.0 


0.0 


0.0 


0.0 0.0 


DEX0448 026. nt. 2 


78479.1 


0.0 


0.0 


0.0 


0.0 


0.0 p . 0 


DEX0448 028.nt.l 


5305.0 


5.3 


5.6 


7.1 


7-7 


0.0 p . 0 


DEX0448 028.nt.l 


5306-0 


5.3 


5.6 


7.1 


7.1 


0.0 p . 0 


DEX0448 028. nt. 2 


5305.0 


5.3 


5.6 


7.1 


7.7 


0.0 |0 . 0 


DEX0448 028. nt. 2 


5306.0 


5.3 


5.6 


7.1 


7.1 


0.0 lo.o 


DEX0448 034 .nt .1 


5354.0 


0.0 


0.0 


0.0 


0.0 


0.0 p.o 


DEX0448 044.nt.l 


42013 .0 


0.0 


0.0 


0.0 


0.0 


0.0 0.0 


DEX0448 044.nt.l 


42013.1 


0.0 


0.0 


0.0 


0.0 


0.0 0.0 


DEX0448 044.nt,l 


42013.2 


0.0 


0.0 


0.0 


0.0 


0.0 lo.o 



PROSTATE CANCER 

For prostate cancer three different chip designs were evaluated with overlapping 
sets of a total of 29 samples, comparing the expression patterns of prostate cancer or 

5 benign disease derived total RNA to total RNA isolated from a pool of 35 normal prostate 
tissues. For the Prostate 1 Array and Prostate 2 Array Chips all 29 samples (17 prostate 
cancer samples 12 non-malignant disease samples) were analyzed. For the Multi-Cancer 
Anray Chip a subset of 28 of these samples (16 prostate cancer samples, 12 non-malignant 
disease samples) were analyzed. 

10 The results for the statistically significant up-regulated genes on the Prostatel 

Array Chip and Prostate2 Array Chip are shown in Table 1 1. The results for the 
statistically significant up-regulated genes on the Multi-Cancer Array Chip are shown in 
Table 12. The first two columns of each table contain information about the sequence 
itself (DEX ID, Oligo Name), the next columns show the results obtained for prostate 

1 5 cancer samples ("CAN") or non-malignant disease samples ("DIS"). '%up' indicates the 
percentage of all experiments in which up-regulation of at least 2-fold was observed (n=29 
for the Prostate2 Array Chip and the Multi-Cancer Array Chip), •%valid up' indicates the 
percentage of experiments with valid expression values in which up-regulation of at least 
2-fold was observed. 

20 

Table 11. 



DEX ID 


Oligo 
Name 


Pro CAN 
%up nsl7 


Pro CAN 
%valid up 
n=17 


Pro DIS . 
%up n«12 


Pro2 DIS 
%valid up 
n=12 


DEX0448 OlO.nt.l 


38723.02 


0.0 


0.0 


0.0 


0.0 


DEX0448 OlO.nt.l 


38723.03 


0.0 


0.0 


0.0 


0.0 


DEX0448 014.nt.l 


26889.01 


0.0 


0.0 


0.0 


0.0 


DEX0448 014.nt.l 


26889.02 


0.0 


0.0 


0.0 


0.0 


DEX0448 014.nt.l 


27099.01 


0.0 


0.0 


0.0 


0.0 
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DEX0448 014.nt.l 


27099. 02 


0.0 


0.0 


0.0 


0.0 


DEX0448 OlB.nt.l 


32638.01 


0.0 


0 . 0 


0.0 


0. 0 


DEX0448 Oie.nt.l 


32638 .02 


0.0 


0 . 0 


0.0 


0 . 0 


DEX0448_018 .nt . 1 


32638. 03 


0.0 


0 . 0 


8.3 


11.1 


DEX044 8 022.nt.l 


29143 . 01 


11.8 


11 . 8 


8.3 


8 . 3 


DEX044 8 022.nt.l 


29143 . 02 


5.9 


5.9 


8.3 


9 . 1 


DEX0448 022.nt.l 


29143 . 03 


5.9 


5.9 


8 . 3 


8 . 3 


DEX0448 022. nt. 2 


29143 . 01 


11.8 


11 . 8 


8 . 3 


8.3 


DEX0448 022. nt. 2 


29143 . 02 


5.9 


5.9 


8 . 3 


9.1 


DEX0448 022. nt. 2 


29143 . 03 


5.9 


5.9 


8.3 


8.3 


DEX044e 041. nt. a 


28451 . 01 


0.0 


0 . 0 


0 . 0 


0.0 


DEX0448 041.nt.l 


28451 . 02 


0 . 0 


0 . 0 


0 . 0 


0 . 0 


DEX0448 041.nt.l 


32716 . 01 


0 . 0 


0. 0 


0 . 0 


0. 0 


DEX044B 041.nt.l 


32716 . 02 


0 . 0 


0 . 0 


0 . 0 


0. 0 


DEX0448 041.nt.l 


32716 . 03 


0.0 


0.0 


8.3 


14 .3 


DEX0448 041.nt.l 


32718 . 01 


0.0 


0.0 


0 . 0 


0.0 


DEX0448 041.nt.l 


32718.02 


0.0 


0.0 


0.0 


0.0 


DEX044 8 041.nt.l 


32718.03 


0.0 


0.0 


0.0 


0.0 


DEX0448 041.nt.l 


32720.01 


0.0 


0.0 


0.0 


0.0 


DEX0448 041.nt.l 


32720.02 


0.0 


0.0 


0.0 


0.0 


DEX0448 041.nt.l 


32720.03 


0.0 


0.0 


0.0 


0.0 


DEX0448 041.nt.l 


32724 . 01 


0.0 


0.0 


0.0 


0.0 


DEX0448 041.nt.l 


32724.02 


0.0 


0.0 


0.0 


0.0 


DEX0448 041.nt.l 


32724.03 


0.0 


0.0 


0.0 


0.0 


DEX0448 041. nt. 2 


32716.01 


0.0 


0.0 


0.0 


0.0 


DEX0448 041. nt. 2 


32716.02 


0.0 


0.0 


0.0 


0.0 


DEX044 8 041.nt.2 


32716.03 


0.0 


0.0 


8.3 


14.3 



Table 12, 



DEX ID 


Oligo 
Name 


Pro Multi- 
Cancer CAN 
%up n=16 


Pro Multi- 
Cancer CAN 
%valid up 
n=16 


Pro Multi- 
Cancer DIS 
%up n=12 


Pro Multi- 
Cancer DIS 
%valid up 
n=12 


DEX0448 Oie.nt.l 


5354.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 019.nt.l 


1045.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 026.nt.l 


78479 . 0 


0.0 


0.0 


0.0 


0.0 


DEX0448 026.nt.l 


78479. 1 


0-0 


0.0 


0.0 


0.0 


DEX0448 026. nt. 2 


78479.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 026. nt. 2 


78479.1 


0.0 


0.0 


0.0 


0.0 


DEX0448 028.nt.l 


5305.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028.nt.l 


5306.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028. nt. 2 


5305.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 028. nt. 2 


5306.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 034.nt.l 


5354.0 


0.0 


0.0 


0.0 


0.0 


DEX0448 044.nt.l 


42013.0 


0.0 


0,0 


0.0 


0.0 


DEX0448 044.nt.l 


42013.1 


0.0 


0.0 


0.0 


0.0 


DEX0448 044.nt.l 


42013.2 


0.0 


0.0 


0.0 


0,0 



SEQ ID NO: 1-95 was up-regulated on various tissue microarrays. Accordingly, 
5 nucleotide SEQ ID NO: 1 -95 or the encoded protein SEQ ID NO: 96-237 may be used as 
a cancer therapeutic or diagnostic target for the tissues in which expression is shown. 
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The following table lists the location (Oligo Location) where the microarray oligos 
(Oligo ID) map on the transcripts (DEX ID) of the present invention. Each Oligo ID may 
have been printed multiple times on a single chip as replicates. The Oligo Name is an 
exemplary replicate (e.g. 1000.01) for the Oligo ID (e.g. 1000), and data from other 
5 replicates (e.g. 1 000.02, 1 000.03) may be reported. Additionally, the Array (Chip Name) 
that each oligo and oligo replicates were printed on is included. 



DEX ID 


Oligo 
±u 


Oligo 


Chip Kame 


Oligo 
Location 


U C» A \J*tH O \J \J 1. , iX\, , ± 


J *i 2? *4 U 




jColon array 


a - 4o J 


DT^ynA&R CiOO nh 1 


•3 Q QC*? 


39957.0 


jcolon array 


172-231 




"a QQCQ 


39958.0 


Icolon array 


l4s4 -Id J 


UCi.^\J*i*i O VUJ.I1L..J. 


/ 


32057.0 


|Colon array 


CIO C "7 Q 




41^10 


41210.0 


[colon array 


424-483 


TWVnAAQ Anc o 


J 0^4 J 


36243.0 J 


(colon array 


1134-1193 




DAT fi 


8410.0 ^ 


jColon array 


T T ICQ 
1 1 U - 1 




^ 0 % ^ J 


26449.0 


jColon array 


4 J X -4 y U 


urtAU^^o uuo.nu.l 


Q Q C 1 


32851.0 ^ 


[colon array 


A Q C 0 

209- Joo 




4UoU7 


40809.0 


[colon array 


± 07 J • lyz>4i 




■3 ii r 0 n 
J ft 0 0 u 


34680.0 


[colon array 


Q'i A Q 0*3 

0 J4 -07 J 




4 UolU 


40810.0 


jColon array 


IOC'S 101*5 


opv^^yiiiQ nn n ■•-it- t 




28423.0 


IColon array 


49-108 


uciAU44o u±u.nu*x 




953.0 


|Lung array 


24 J -J 02 


DbA044o OiO.nt.l 


956 


956.0 


fMulti -Cancer array 


45-104 


DEa0448 OiO.nt.l 


17869 


17869.0 


[Breast array 


44-103 


DEA0448 OiO.nt.l 


955 


955.0 


(Lung array 


50-109 


JJt«AU44o UlU.nt.l 


9880 


9880.01 Jjovarian array 


49-108 


UciAU^fto Ui.U.nt.i 


^ Q "7 T "3 


38723.01 lProstate2 array 


50-109 


UrjAU44D UlU.nt.l 


635 


835.0 


[Lung array 


403-462 


DEX0448 OiO.nt.l 


5749 


5749.0 


jLung array 


1038'-1097 


DEX0448 OiO.nt.l 


31466 


31466.0 


[Breast array 


413-472 


DEX0448 OiO.nt.l 


836 


336.0 


[Lung array 


383-442 


DEX0448 Oll.nt.l 


22483 


22483.02 


[Ovarian array 


164-223 


DEX0448 Oll.nt.l 


20711 


20711.0 


[Breast array 


164-223 


DEX0448 Oll.nt.l 


8312 


8312.0 


[colon array 


245-304 


DEX0448 013.nt.l 


16006 


16006.0 


jColon array 


413-472 


DEX0448 014.nt.l 


27099 


27099.01 


jprostatel array 


257-316 


DEX0448 014.nt.l 


37378 


37378.0 


[colon array 


137-196 


DEX0448 014.nt.l 


37376 


37376.0 


[Colon array 


137-196 1 


DEX0448 014.nt.l 


26889 


26889.02 


jProstatel array 


170-229 1 


DEX0448 OlS.nt.l 


38996 


38996.0 


[colon array 


907-966 


DEX044 8 OlS.nt.l 


33348 


33348.0 


[colon array 


476-535 


DEX0448 Oie.nt.l 


22545 


22545.0 


[Breast array 


444-503 


DEX0448 OlS.nt.l 


26543 


26543.0 


[Breast array 


202-261 


DEX0448 Oie.nt.l 


21617 


21617.01 


|c5varian array 


207-266 


DEX0448 Oie.nt.l 


5354 


5354.0 


jMulti-Cancer array 


434-493 
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DEX0448 016.nt.l 


22546 1 


22546.0 j 


Breast array 


414-473 


DEX0448 Oie.nt.l 


21619 


21619.01 J 


Ovarian array 


295-354 


DEX0448 Oie.nt.l 


5353 


5353.0 1 


Lung array 


445-504 


DEX0448 016. nt. 2 


26543 


26543.0 J 


Breast array 


439-498 


DEX0448 016. nt. 2 


22545 1 


22545.0 ] 


Breast array 


681-740 


DEX0448 016. nt. 2 


21619 1 


21619.01 J 


Ovarian array 


532-591 


DEX0448 016. nt. 2 


21613 1 


21613.02 J 


Ovarian array 


815-874 


DEX0448 016. nt. 2 


22546 1 


22546.0 J 


Breast array 


651-710 


DEX0448 016. nt. 2 


5354 1 


5354.0 J 


Multi -Cancer array 


671-730 


DEX0448 016. nt. 2 


21617 1 


21617.01 J 


Ovarian array 


444-503 


DEX0448 016. nt. 2 


5353 


5353 .0 


Lung array 


682-741 


DEX0448 016. nt. 3 


26543 


,26543-0 


Breast array 


68-127 


DEX0448 016. nt. 3 


22545 


22545.0 


Breast array 


[310-369 


DEX0448 016. nt. 3 


21617 


21617.01 


Ovarian array 


73-132 1 


DEX0448 016. nt. 3 


5353 


5353.0 


Lung array 


311-370 1 


DEX0448 016. nt. 3 


22546 


22546.0 


Breast array 


280-339 1 


DEX0448 016. nt. 3 


5354 1 


5354.0 


Multi -Cancer array 


300-359 1 


DEX0448 016. nt. 3 


21619 _| 


21619.01 


Ovarian array 


161-220 1 


DEX0448 017.nt.l 


37895 _| 


37895.0 


Colon array 


863-922 1 


DEX0448 017.nt.l 


37227 _j 


37227 .0 


Colon array 


724-783 1 


DEX0448 017.nt.l 


37239 1 


37239.0 


Colon array 


319-378 1 


DEX0448 017.nt.l 


29052 1 


29052.0 


Breast array 


871-930 1 


DEX0448 017.nt.l 


36160 


36180 .0 


Colon array' 


551-610 1 


DEX0448 017.nt.l 


36164 


36184.0 


Colon array 


218-277 


DEX0448 017.nt.l 


36162 


36182.0 


Colon array 


724-783 


DEX0448 017.nt.l 


36179 


36179.0 


Colon array 


592-651 


DEX0448 017.nt.l 


37228 


37228.0 


Colon array 


694-753 


DEX0448 017.nt.l 


36181 


36181.0 


Colon array 


852-911 


DEX0448 017.nt.l 


37240 


37240.0 


Colon array 


299-358 


DEX0448 018.nt.l 


13323 


13323 . 0 


Breast array 


902-961 


DEX0448 018.nt.l 


13245 


13245.0 


Breast array 


1209-1268 


DEX0448 Oie.nt.l 


13337 


13337.0 


Breast array 


1195-1254 


DEX0448 018.nt.l 


13299 


13299.0 


Breast array 


853-912 


DEX0448 018.nt.l 


13338 


13338.0 


Breast array 


1165-1224 


DEX0448 Oie.nt.l 


41823 


41823.0 


Colon array 


544-603 


DEX0448 OlS.nt.l 


41824 


41824.0 


Colon array 


495-554 


DEX0448 Oie.nt.l 


13320 


13320.0 


Breast array 


1147-1206 


DEX0448 Oie.nt.l 


32638 


32636.03 


Prostate2 array 


897-956 


DEX0448 Oie.nt.l 


13321 


13321.0 


Breast array 


885-944 


DEX044e OlS.nt.l 


1045 


1045.0 


Multi -Cancer array 


144-203 


DEX0448 019.nt.l 


1044 


1044.0 


iLung array 


154-213 


DEX044e 020.nt.l 


39656 


139656.0 


Icolon array 


1033-1092 


DEX044e 020.nt.l 


|39655 


139655.0 


Icolon array 


1201-1260 


D£X044e 021.nt.l 


135218 


]35218.0 


Icolon array 


1621-1680 


DEX0448 021.nt.l 


|31155 


131155.0 


[Colon array 


892-951 


'DEX044e 021.nt.l 


|30870 ||30870.0 


IColon array 


1621-1680 


DEX0448 021.nt.l 


31146 1131146. 0 


[colon array 


603-662 
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30819 


30819 . 0 


Colon array 


1621-1680 


jUc.AU^4o_ 021 . nt . 1 


31157 


31157 . 0 


Colon array 


1611-1670 


juxLAUft^ o_u^i .nc . 1 


31154 


31154 , 0 


Colon array 


1364-1423 


|-Uc#AU44a 021.nt.l 


30931 


30931 . 0 


Colon array 


653-712 


iUtAU44o 021.nt.2 


30870 


30870 . 0 


Colon array 


1926-1985 


JJriAU44o 021. nt. 2 


31154 


31154 . 0 


Colon array 


1669-1728 


lJ£.AU44o 021. nt. 2 


311S7 


31157 . 0 


Colon array 


1916-1975 


iJJiA044o 021. nt. 2 


30819 


30819 . 0 


Colon array 


1926-1985 


UbAU44o 021. nt. 2 


31146 


31146 . 0 


Colon array 


603-662 


iJCjAU44Q 021. nt. 2 


30931 


30931 . 0 


Colon array 


653-712 


iJ£4AU44o u^i.nc.2 


31155 


31155 . 0 


Colon array 


892-951 


iJaAU44o U^l.nC.2 


35218 


35218 , 0 


Colon array 


1926-1985 


TM?Yf>AAQ ATT T 
UCiAU4 4 D 021. nt, 3 


31154 


31154 . 0 


Colon array 


207-266 


JJiiAU44o 021. nt. 3 


35218 


35218 .0 


Colon array 


464-523 


iJhiAU4l4o U21.nt.3 


30819 


30819. 0 


Colon array 


464-523 


UiLAU44o 021. nt. 3 


31157 


31157 . 0 


Colon array 


454-513 


O£.A044a 021. nt. 3 


30870 


30870.0 


Colon array 


464-523 


DEA0448 022.nt,l 


37415 


37415 . 0 


Colon array 


452-511 


T^f Y f\A A O AO'S T 

UCiA 04 4 o 02 2 • nt . 1 


792 


792 . 0 


Lung array 


452-511 


UcjAU44t5 022.nt.l 


36878 


36878 . 0 


Colon array 


502-561 


UriAU44o 022.nC.l 


791 


791 . 0 


Lung array 


457-516 


Uii A U44d 022,nt.l 


855 


855 . 0 


Lung array 


285-344 


iJc.A044o 022.nt.l 


804 


804 . 0 


Lung array 


438-497 


T^TVA/! /I O A10 M ^ ^ 

iJbA044o 022.nt.l 


32151 


32151 . 0 


Breast array 


163-222 


Uc.a044o 022.nt.l 


803 


803.0 


Lung array 


441-500 


UC.A044O 022.nt.l 


29143 


29143 . 01 ■ 


Prostate2 array 


259-318 


DEA0448 022.nt.l 


32150 


32150. 0 


Breast array 


203-262 


Ur.A044o 022. nt. 2 


804 


804 . 0 


Lung array 


333-392 


l^C Y r^A A Q AOn 

UCiA044o 022. nt. 2 


37415 


37415 . 0 


Colon array 


347-406 


UiiA044o 022. nt. 2 


791 


791 . 0 


Lung array | 


352-411 


L'bA044o 022. nt. 2 


32151 


32151 . 0 


Breast array | 


58-117 


Ur«A044o 022. nt. 2 


36878 


36878 .0 


Colon array 1 


397-456 


DbA044o 022. nt. 2 


792 


792.0 


Lung array | 


347-406 


DEX0448 022. nt. 2 


29143 


29143 . 01 


Prostate2 array | 


154-213 


DEA0448 022. nt. 2 


803 


803.0 


Lung array | 


336-395 1 


DEX0448 022. nt. 2 


855 


855.0 


Lung array | 


180-239 _J 


DEX0448 023.nt.l 


22297 


22297.0 


Colon array | 


1117-1176_| 


DEX0448 024.nt.l 


19607 


19607.0 


Colon array j 


199-258 1 


DEX0448 025.nt.l 


40034 


40034 . 0 


Colon array ( 


583-642 _J 


DEX0448 025.nt.l 


40033 


4 0033 . 0 


UD-ltJiJ al^JLcLy I 


•7 0 il QA'i 1 
/ 0^ - o4 J [ 


DEX0448 026.nt.l 


78479 


78479.0 


Multi -Cancer array | 


1695-1754 1 


DEX0448 026.nt.l 


41284 


41264.0 


Colon array | 


2169-2228 | 


DEX0448 026.nt .1 | 


41210 


41210.0 


Colon array | 


1892-1951 


DEX0448 026. nt. 2 | 


78479 


78479.0 


Multi -Cancer array | 


1071-1130 


DEX0448 026. nt. 2 | 


41284 


41284 .0 


Colon array | 


1545-1604 


DEX0448 027.nt.i 


15836 


15836.0 


Breast array | 


5393-5452. 


DEX0448_027.nt.l | 


32137 1 


32137.0 


Breast array | 


5280-5339 
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DEX04 4 8 027.nt.l 


15311 


|15311.0 llBreast array | 


4539-4598 


DEX0448 027.nt.l 


15846 


|15846.0 llBreast array | 


5624-5683 


DEX044 8 027.nt.l 


15845 


|15845.0 llBreast array | 


5871-5930 


DEX0448 027.nt.l 


15310 


|15310.0 [(Breast array 1 


4588-4647 


DEX0448 027.iit.l 


32136 


|32136.0 llBreast array | 


5363-5422 


DEX0448 027.nt.l 


15834 


|15834.0 llBreast array | 


3574-3633 


DEX0448 027.nt.l 


9623 


|9623 .0 


|Colon array ! 


5423-5482 


DEX0448 027.nt.l 


15835 


|15835.0 


{Breast array | 


5423-5482 


DEX0448 027.nt.l 


9622 


19622.0 


IColon array | 


5527-5586 


DEX0448 027.nt.l 


15833 


|15833.0 


iBreast array | 


3594-3653 


DEX0448 027. nt. 2 


15845 


15845.0 


Breast array | 


5739-5798 


DEX0448 027. nt. 2 


15833 


15833 . 0 


Breast array | 


3462-3521 


DEX0448 027. nt. 2 


9622 


9622.0 


Colon array j 


5395-5454 


DEX0448 027. nt. 2 


15311 


15311.0 


Breast array | 


4407-4466 


DEX0448 027. nt. 2 


15846 


15846.0 


Breast array ) 


5492-5551 


DEX044B 027. nt. 2 


15836 


15836.0 


Breast array j 


5261-5320 


DEX0448 027. nt. 2 


15310 


15310.0 


Breast array j 


4456-4515 


DEX0448 027. nt. 2 


15835 


15835.0 


Breast array J 


5291-5350 


DEX0448 027. nt. 2 


32136 


32136.0 


Breast array | 


5231-5290 


DEX0448 027. nt. 2 


9623 


9623.0 


Colon array | 


5291-5350 


DEX0448 027. nt. 2 


15834 


15834 .0 


Breast array J 


3442-3501 


DEX0448 027. nt. 2 


32137 


32137.0 


Breast array | 


5148-5207 


DEX0448 027. nt. 3 


15836 


15836.0 


Breast array | 


7132-7191 


DEX0448 027. nt. 3 


15833 


15833.0 


Breast array | 


4013-4072 


DEX0448 027. nt. 3 


9622 


9622.0 


Colon array | 


7266-7325 


DEX044 8 027.nt.3 


32137 


32137,0 


Breast array | 


7019-7078 


DEX0448 027. nt. 3 


9623 


9623.0 


Colon array | 


7162-7221 


DEX0448 027. nt. 3 


15835 


15835.0 


Breast array | 


7162-7221 


DEX0448 027. nt. 3 


32136 


32136.0 


Breast array | 


7102-7161 


DEX0448 027. nt. 3 


15310 


15310.0 


Breast array _) 


6327-6386 


DEX0448 027. nt. 3 


15834 


15834.0 


Breast array | 


3993-4052 


DEX044 8 027. nt. 3 


15846 


15846.0 


Breast array J 


7363-7422 


DEX044B 027. nt. 3 


15311 


15311.0 


Breast array | 


6278-6337 


DEX0448 027. nt. 3 


15845 


15845.0 


Breast array | 


7610-7669 


DEX044B 027. nt. 4 


15836 


15836.0 


Breast array | 


5124-5183 


|DEX0448_027.nt.4 


15311 


15311.0 


Breast array | 


4270-4329 


DEX0448 027. nt. 4 


32136 


32136.0 


Breast array | 


5094-5153 


DEX0448 027. nt. 4 


15833 


15833.0 


Breast array J 


3325-3384 


DEX0448 027. nt. 4 


15834 


15834.0 


Breast array _| 


3305-3364 


DEX044S 027. nt. 4 


9623 


9623 .0 


Colon array J 


5154-5213 


DEX0448 027. nt. 4 


15835 


15835.0 


Breast array | 


5154-5213 


DEX0448 027. nt. 4 


9622 


9622.0 


Colon array j 


5258-5317 


DEX0448 027. nt. 4 


32137 


32137.0 


Breast array | 


5011-5070 


DEX0448 027. nt. 4 


15310 


15310.0 


Breast array j 


4319-4378 


DEX044 8 027. nt. 5 


15835 


15835.0 


Breast array | 


5154-5213 


DEX0448 027. nt. 5 


9623 


9623.0 


Colon array | 


5154-5213 


DEX0448 027. nt. 5 


15836 


15836.0 


Breast array | 


5124-5183 
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|DEX0448_027.nt .5 


15834 


15834.0 


Breast array 


3305-3364 


|DEX0448_027.nt .5 


9622 


9622,0 


Colon array 


5258-5317 


|DEX0448_02 7.nt.5 


15833 


15833.0 


Breast array 


3325-3384 


|DEX0448 027. nt. 5 


32136 


32136.0 


Breast array 


5094-5153 


|DEX0448 027. nt. 5 


15311 


15311.0 


Breast array 


4270-4329 


|DEX044 8_027.nt.5 


32137 


32137.0 


Breast array 


5011-5070 


[PEX0448 027. nt. 6 


15846 


15846.0 


Breast array 


1797-1856 


fDEX044 8_027.nt .6 


15835 


15835,0 


Breast array 


1596-1655 


|dEX0448 027. nt. 6 


9622 


9622.0 


Colon array 


1700-1759 


[DEX0448 027. nt. 6 


15845 


15845.0 


Breast array 


2044-2103 


|DEX0448 027. nt .6 


9623 


9623 .0 


Colon array 


1596-1655 


|DEX0448 027. nt. 6 


15310 


15310.0 


Breast array 


761-820 


|DEX044 8_027.nt .6 


15836 


15836.0 


Breast array 


1566-1625 


|DEX0448_027.nt .6 


32137 


32137.0 


iBreast array 


1453-1512 


|dEX0448 027. nt. 6 


32136 


32136.0 


[Breast array 


1536-1595 


|DEX0448_027.nt.6 


15311 


15311.0 


Breast array 


712-771 


|DEX0448 028.nt.l 


5305 


5305.0 


Multi -Cancer array 


679-738 


jpEX0448_028.nt.l 


1263 


1263.0 


Lung array 


653-712 


|DEX0448_028.nt.l 


1265 


1265,0 


Lung array 


648-707 


|DEX04 4 8_02 8.nt.l 


1262 


1262.0 


Lung array 


674-733 


|pEX0448_028.nt.l 


5306 


5306.0 


Multi -Cancer array 


648-707 


|DEX0448_028.nt.a 


1320 


1320.0 


|Lung array 


988-1047 


^X0448_028.nt.l 


1264 


1264.0 


|Lung array 


679-738 


|dEX0448 028. nt. 2 


1262 


1262.0 


[Lung array 


423-482 


lDEX044 8_028.nt.2 


1264 


1264.0 


jLung array 


428-487 


[dEX0448_028 .nt.2 


5305 


5305.0 


(Multi -Cancer array 


428-487 


|DEX0448_028.nt.2 


1263 


1263.0 


|Lung array 


402-461 


[DEX0448_028.nt.2 


1320 


1320.0 


[Lung array 


812-871 


[DEX0448 028. nt.2 


1265 


1265.0 


|Lung array 


397-456 


^EX0448_028.nt.2 


5306 


5306.0 


|Multi -Cancer array 


397-456 


|DEX0448_029.nt.l 


38381 


38381.0 


IColon array 


1892-1951 


!DEX0448_029.nt.l 


28733 


28733.0 


IColon array 


2187-2246 


[DEX0448_029-nt.l 


38383 


38383.0 


Icolon array 


2546-2605 


|DEX0448_029.nt.l 


38384 


38384.0 


|Colon array 


2526-2585 


IDEX044B 029.nt.l 


38382 


38382.0 


jcolon array 


1872-1931 


(DEX0448_029,nt.l 


28734 


28734,0 Ijcolon array 


2147-2206 


|dEX0448 029. nt.2 


38383 


38383.0 


Colon array 


2101-2160 


(DEX04 48^029. nt.2 


28733 


28733.0 


Colon array 


1742-1801 


§EX0448_029.nt .2 


38381 


38381.0 


Colon array 


1447-1506 


|DEX0448_029.nt.2 


38384 


38384.0 


Colon array 


2081-2140 


|DEX0448_029.nt .2 


38382 


38382.0 llColon array 


1427-1486 


[DEX0448 029. nt.2 


28734 


28734,0 llColon array 


[1702-1761 


|DEX0448_029.nt.3 


28733 


28733.0 llColon array 


|1745-1804 


|DEX044 8_029.nt.3 


28734 


28734.0 


IColon array 


[1705-1764 


[DEX0448_029.nt.3 


38382 


38382.0 


Icolon array 


[1430-1489 


|DEX044 8_029.nt.3 


38384 


38384.0 . 


Icolpn array 


[20§4-2143 


|DEX0448_029.nt.3 


38381 


38381.0 


IColon array 


[1450-1509 



wo 2004/050860 



PCT/US2003/040063 



206 



DEX0448_029.nt.3 


38383 


(38383. 0 


iColon array 


2104-2163 


DEX0448_029.nt.4 


28734 


128734.0 


IColon array 


2219-2278 


DEX0448 029. nt. 4 


38382 


|38382.0 


IColon array 


1944-2003 


DEX0448_029.nt.4 


38384 


|38384.0 


jColon array 


2598-2657 


DEX0448._029.nt.4 


38381 


|38381.0 


|Colon array 


1964-2023 


DEX0448_029.nt.4 


28733 


[28733. 0 


Colon array 


2259-2318 


DEX0448 029. nt. 5 


38383 


38383.0 


Colon array 


1910-1969 


DEX0448 029. nt. 5 


38381 


38381.0 


Colon array 


1256-1315 


DEX0448 029. nt. 5 


38382 


38382.0 


Colon array 


1236-1295 


DEX0448 029. nt. 5 


38384 


38384.0 


Colon array 


1890-1949 


DEX044 8 029. nt. 5 


28734 


28734.0 


Colon array 


1511-1570 


DEX0448 029. nt. 6 


38383 


38383.0 


Colon array 


1944-2003 


DEX0448 029. nt. 6 


28733 


28733.0 


Colon array 


1585-1644 


DEX0448 029. nt. 6 


38381 


38381.0 


Colon array 


1388-1447 


DEX0448 029. nt. 6 


36384 


38384 .0 


Colon array 


1924-1983 


DEX0448 029. nt. 6 


38382 


36382.0 


Colon array 


1368-1427 


DEX044 8 029. nt, 6 


28734 


28734.0 


Colon array 


1545-1604 


DEX0448 029. nt. 7 


28733 


28733.0 


Colon array 


1332-1391 


DEX0448 029. nt. 7 


28734 


28734.0 


Colon array 


1292-1351 


DEX0448 029. nt. 7 


38382 


38382.0 


Colon array 


1017-1076 


DEX0448 029. nt. 7 


38384 


38384.0 


Colon array 


1671-1730 


DEX0448 029. nt. 7 


38381 


38381.0 


Colon array 


1037-1096 


DEX0448 029. nt. 7 


38383 


38383.0 


Colon array 


1691-1750 


DEX0448 O30.nt.l 


29604 


29604.0 


Colon array 


1800-1859 


DEX0448 030.nt.l 


40868 


40868.0 


Colon array 


1413-1472 


DEX0446 030.nt-l 


40867 


40867.0 


Colon array 


1536-1595 


DEX0448 030.nt.l 


29603 


29603.0 


Colon array 


1840-1899 


DEX0448 032.nt.l 


37239 


37239.0 


Colon array 


258-317 


DEX0448 032.nt.l 


36179 


36179.0 


Colon array 


531-590 


DEX0448 032.nt.l 


37227 


37227.0 


Colon array 


663-722 


DEX0448 032.nt.l 


37240 


37240.0 


Colon array 


238-297 


DEXD448 032.nt.l 


36184 


36184.0 


Colon array 


157-216 


DEX0448 032.nt.l 


36180 


36180.0 


Colon array 


490-549 


"■ — 7" , 
DEX044 8 032.nt.l 


36182 


36182.0 


Colon array 


663-722 


DEX0448 032.nt.l 


37228 


37228.0 


Colon array 


633-692 


DEX0448 033.nt.l 


22294 


22294.0 


Breast array | 


1462-1521 


DEX0448 034.nt.l 


22546 


22546.0 


Breast array ||236-295 


DEX0448 034.nt.l 


21619 


21619.01 


Ovarian array ||ll7-l76 


DEX0448 034. lit. 1 


5353 


5353.0 


filing array ||267-326 


UbAU44D 034.nt.l 


22545 


22545 .0 


Breast array ||2 66 -3 25 


DEX0448 034.nt.l 


5354 


5354.0 


Multi-Cancer array ||256-315 


DEX0448 034.nt.l 


21613 


21613.02 


Ovarian array ||400-459 


DEX0448 035-nt.l 


3729 


3729.0 


Lung array ||362e-3687 


DEX0448 035.nt.l | 


23863 


23863.0 


Colon array ||4553-4612 


DEX0448 OBS.nt.l | 


23862 


23862.0 


Colon array ||4617-4676 


DEX044 8- 035..nt.l | 


40309 . 


40309.0 


Breast array ||4 890-4949 


DEX0448 035. nt. 2 | 


23863 


23863.0 


Colon array ||4372-4431 



wo 2004/050860 



PCT/US2003/040063 



207 



jDEX044 8_035.nt.2 


|3729 


3729.0 


Lung array 


3447-3506 | 


|DEX044 8_035.nt .2 


|40309 


40309.0 


Breast array 


4709-'4768j 


|DEX0448_035.nt .2 


|23862 


23862.0 


Colon array 


4436-4495 j 


|DEX044 8_035.nt .3 


[3729 


3729.0 


Lung array 


3358-3417 ( 


|DEX0448_035.nt.3 


|23863 


23863.0 llcolon arrav 


4283-4342 J 


|DEX044 8_035.nt .3 


|40309 


40309.0 llfireast arrav 


4620-4679 ( 


|DEX0448_035.nt.3 


|23862 


23862.0 llcolon arrav 


4347-4406 [ 


[DEX0448_035.nt.4 


(40309 


4 0309.0 [(Breast array 


4106-4165 [ 


|DEX0448_035.nt.4 


|3729 


3729.0 _j(Lung array 


2844-2903^ 


|dEX044 8 035. nt. 4 


|23863 


23863.0 Ijcolon array 


3769-3828 | 


jDEX0448_036.nt .1 


(36812 


36812.0 J(colon array 


318-377 1 


|dEX0448 036.nt.l 


[29532 


29533.0 ](colon array 


660-719 1 


lDEX0448_036.nt .1 


|36811 


36811.0 JjColon array 


348-407 1 


|DEX0448_036.nt.l 


[29534 


29534.0 llColon array 


640-699 1 


|dEX0448 036.nt.l 


[29532 


29532.0 ([Colon array 


1444-1503 j 


|DEX044 8__036 .nt .1 


[2954 0 


29540.0 JColon array 


1265-1324 ( 


|DEX0448 036. nt. 2 


[29539 


29539.0 jjcolon array 


983-1042 ( 


lDEX044 8_036.nt .2 


[29532 


29532.0 |(colon array 


1250-1309 1 


|DEX044_8_03 6.nt.3 


[29539 


29539.0 |(Colon array 


289-348 1 


[dEX04 4 8_03 6 .nt .3 


129532 


29532.0 


jColon array 


556-615 1 


|DEX044 8J)36.nt.3 


(29540 


29540.0 


[colon array 


54-113 [ 


|DEX0448_037.nt .1 


[36341 


36341.0 


(Colon array 


645-704 ( 


|DEX0448_037.nt .1 


[36342 


36342.0 


(colon array 


565-624 t 


[DEX0448 037. nt. 2 


(36341 


36341.0 


[colon array 


738-797 J 


|DEX044 8_037.nt.2 


[36342 


36342.0 


(colon array 


658-717 1 


lDEX0448__037.nt .3 


[36341 


36341.0 


(Colon array 


1583-1642 1 


|DEX044 8 037. nt. 3 


[36342 


36342.0 


(colon array 


1503-1562 [ 


|DEX0448_037.nt.4 


(36341 


36341.0 


[Colon array |[774-833 | 


|DEX0448 037. nt. 4 


[36342 


36342.0 


(colon array ^[694 -753 ] 


(DEX0448 037. nt. 5 


(36341 


36341.0 J 


(Colon array 1(774-833 | 


^EX044 8_037.nt.6 


[36342 


36342.0 


(Colon array 


694-753 ( 


|DEX0448_038.nt.l 


[20896 1 


20896.0 


[Colon array 


40-99 1 


(DEX0448_038.nt,l 


[20895 


20895.0 


(Colon array 


80-139 [ 


|DEX0448 03d.nt.l 


(38855 


38855.0 


(Colon array 


913-972 ( 


|DEX044 8 04 0.nt.l 


[40756 


4 0756.0 |[Colon array 


[191-250 1 


|deX0448 040.nt .1 


[31347 


31347.0 


(Colon array 


237-296 I 


|DEX044 8_04 0.nt.l 


|40755 


4 0755.0 llColon array 


[237-296 J 


|DEX044 8_04 0.nt.2 


(4 0756 


40756.0 |(colon array 


487-546 1 


1deX0448 040.nt .2 


[7435 


7435.0 J|Lung array 


[259-318 1 


|dEX0448 040. nt. 2 


[31347 


3134 7.0 Jlcolon array 


[533-592 [ 


|DEX044 8_04 0.nt .2 


(7434 


7434.0 |(Lung array 


[299-358 ( 


IDEX0448 040. nt. 2 


[31346 


31346.0 llcolon array 


[665-724 1 


^EX0448_040.nt.2 


[40755 


40755.0 JjColon array 


[533-592 


|DEX044B_040.nt.3 


[7435 


7435.0 Ifcung array 


[268-327 


|DEX044 8 040.nt .3 


[40755 


40755.0 ijcoloh array 


542-601 


jDEX044 8_04 0.nt :3 


[7434 


74-34.0' '[Lung array 


308-367 


|pEX04 4 8_04 0.nt .3 


(31347 


31347.0 


[colon array 


542-601 
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DEX0448 040, nt. 3 


31346 


31346 .0 


Colon array ||674-733 


DEX0448 040. nt. 3 


" ■ " " 

40756 


40756.0 


Colon array (|496-555 


DEX0448 040. nt. 4 


7434 


7434.0 


Lung array 


249-308 


DEX0448 040. nt. 4 


31346 


31346.0 


Colon array 


615-674 


DEX0448 040. nt. 4 


31347 


31347.0 


Colon array 


483-542 


DEX0448 040. nt. 4 


7435 


7435.0 


Lung array 


209-268 


DEX0448 040.nt.4 


40755 


40755.0 


Colon array 


483-542 


DEX044 8 040. nt. 5 


7435 


7435.0 


Lung array 


155-214 


DEX044e 040. nt. 5 


31347 


31347.0 


Colon array 


429-488 


DEX0448 040. nt. 5 


40756 


40756.0 


Colon arrav 


383-442 


DEX0448 040. nt. 5 


7434 


7434.0 


Lung array 


195-254 


DEX0448 040.nt.5 


31346 


31346.0 


Colon array 


561-520 


DEX0448 040. nt. 5 


40755 


40755.0 


Colon array 


429-488 


DEX0448 040. nt. 6 


35943 


35943.0 


Colon array 


822-881 


DEX0448 040. nt. 6 


40756 


40756.0 


Colon array 


1684-1743 


DEX0448 040.nt.6 


31347 


31347.0 


Colon array 


1730-1789 


DEX044 8 04 0.nt.6 


35944 


35944.0 


Colon array 


779-838 


DEX0448 040, nt. 6 


7434 


7434.0 


Lung array 


1496-1555 


DEX0448_040.nt .6 | 


7435 


7435.0 


Lung array 


1456-1515 


DEX044 8_04 0.nt .6j 


31346 


31346.0 


Colon array 


1862-1921 


DEX0448_040.nt.6 


40755 


40755.0 


Colon array 


1730-1789 


DEX0448__040 . nt . 7 | 


31346 


31346.0 


Colon array 


310-369 


DEX0448 040. nt. 7 J 


40756 


40756.0 


Colon array 


132-191 


DEX0448..040.nt .7 | 


31347 


31347.0 


Colon array 


178-237 


DEX0448_041.nt.l | 


32724 


32724.03 


Prostate2 array 


1976-2035 


DEX044 8_041.nt.l | 


32720 


32720.01 


Prostate2 array 


2016-2075 


DEX0448 041.nt.l i 


28451 


28451.01 


Prostatel array 


2240-2299 


DEX044 8_041.nt .Ij 


32216 


32216.0 


Colon array 


1632-1691 


DEX0448 041.nt.l 


32718 


32718.01 


Prostate2 array 


2016-2075 


DEX0448 041.nt.l 


32716 


32716.03 


Prostate2 array 


168-227 


DEX0448 041. nt. 2 


32216 


32216.0 


Colon array 


1632-1691 


|DEX044 8 041.nt .2 


32716 


32716.03 


Prostate2 array 


168-227 


DEX0448 042.nt.l 


7412 


7412-0 


Lung array 


1245-1304 J 


DEX0448 042.nt.l 


29231 


29231.0 


Colon array 


1278-1337 


DEX0448 042.nt.l 


29272 


29272.0 


Colon array 


1587-1646 


DEX0448 042.nt.l 


7413 


7413.0 


Lung array 


1158-1217 


DEX0448 042.nt.l 


29232 


29232.0 


Colon array 


1229-1288 


DEX0448 042.nt.l 


29290 


29290.0 


[Colon array 


1349-1408 


DEX0448 042.nt.l 


29271 


29271.0 


jColon array 


1627-1686 


DEX0448 043.nt.l 


9043 


9043 . 0 


Icolon array 


181-240 


DEX0448 043.nt.l 


22566 


22586.0 


iBreast array 


117-176 


DEX0448 043. nt. 2 


966 


966.0 


iLung array 


716-775 


DEX0448 043. nt. 2 


9043 


9043.0 


Icolon array 


240-299 


DEX0448 043. nt. 2 


943 


943.0 


jLung array 


942-1001 


DEX0448 043. nt. 2 


960 


960.0 


[Lung . array 


436-495 


DEX0448 043. nt. 2 


22585 


22585.0 


iBreast array 


222-281 


DEX0448 043. nt. 2 


939 


939.0 


iLung array 


658-717 
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DEX0448 043. nt. 2 


944 


944.0 


Lung array 


922-981 


DEX0448 043. nt. 2 


22S86 


22586.0 


Breast array 


178-237 


DEX0448 043. nt. 2 


940 


940-0 


Lung array 


521-580 


DEX0448 043. nt. 2 


946 


946.0 


Lung array 


1076-1135 


DEX0448 044.nt.l 


36803 


36803.0 


Colon array 


433-492 


DEX0448 044.nt.l 


42013 


42013.0 


Multi-Cancer array 


628-687 


DEX0448 044. nt. 2 


36803 


36803.0 


Colon array 


827-886 



Example 2b: Relative Quantitation of Gene Expression 

Real-Time quantitative PGR with fluorescent Taqman® probes is a quantitation 
5 detection system utilizing the 5'- 3' nuclease activity of Taq DNA polymerase. The 
method uses an internal fluorescent oligonucleotide probe (Taqman®) labeled with a 5' 
reporter dye and a downstream, 3' quencher dye. During PGR, the 5 '-3' nuclease activity 
of Taq DNA polymerase releases the reporter, whose fluorescence can then be detected by 
the laser detector of the Model 7700 Sequence Detection System (PE Applied Biosystems, 

10 Foster Gity, CA, USA). Amplification of an endogenous control is used to standardize the 
amount of sample RNA added to the reaction and normalize for Reverse Transcriptase 
(RT) efficiency. Either cyclophilin, glyceraldehyde-3 -phosphate dehydrogenase 
(GAPDH), ATPase, or ISS ribosomal RNA (rRNA) is used as this endogenous control. 
To calculate relative quantitation between all the samples studied, the target RNA levels 

1 5 for one sample were used as the basis for comparative results (calibrator). Quantitation 
relative to the "calibrator" can be obtained using the comparative method (User Bulletin 
#2: ABI PRISM 7700 Sequence Detection System). 

The tissue distribution and the level of the target gene are evaluated for every 
sample in normal and cancer tissues. Total RNA is extracted from normal tissues, cancer 

20 tissues, and from cancers and the corresponding matched adjacent tissues. Subsequently^ 
first strand cDNA is prepared with reverse transcriptase and the polymerase chain reaction 
is done using primers and Taqman^ probes specific to each target gene. The results are 
analyzed using the ABI PRISM 7700 Sequence Detector. The absolute numbers are 
relative levels of expression of the target gene in a particular tissue compared to the 

25 calibrator tissue. 

One of ordinary skill can design appropriate primers. The relative levels of 
expression of the CSNA versus normal tissues and other cancer tissues can then be 
determined. All the values are compared to the calibrator. Noimal RNA samples are 
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commercially available pools, originated by pooling samples of a particular tissue from 
different individuals. 

The relative levels of expression of the CSNA in pairs of matched samples may 
also be determined. A matched pair is formed by mRNA from the cancer sample for a 
5 particular tissue and mRNA from the normal adjacent sample for that same tissue from the 
same individual. All the values are compared to the calibrator. 

In the analysis of matching samples, the CSNAs show a high degree of tissue 
specificity for the tissue of interest. These results confirm the tissue specificity results 
obtained with normal pooled samples. Further, the level of mRNA expression in cancer 
10 samples and the isogenic normal adjacent tissue from the same individual are compared. 
This comparison provides an indication of specificity for the cancer state (e.g. higher 
levels of mRNA expression in the cancer sample compared to the normal adjacent). 

Informaton on the samples tested in the QPCR experiments below include the 
Sample ID (Smpl ID), Organ, Tissue Type (Tiss Type), Diagnosis (DIAG), Disease 
1 5 Detail, and Stage or Grade (STG or GRD) in following table. 



Sample 
ID 


ORGAN 


TISS 
TYPE 


DIAG 


DISEASE DETAIL 


STG or GRD 


AS12 


Colon 


CAN 




T 


StageB 


AS12 


Colon 


NAT 




NL 




AS46 


Colon 


CAN 




malignant 


T3N1MX 


AS46 


Colon 


NAT 




NAT 




B34 


Colon 


CAN 


Adenocarcin 
oma 






B34 


Colon 


NAT 




NAT 




C9XR 


Colon 


CAN 




Rectum Cancer 


Stage D 


C9XR 


Colon 


NAT 




NAT 




CM67 


Colon 


CAN 


Adenocarcin 
oma 


Adenocarcinoma 
of cecum, 
Moderately 
differentiated 


Stage II 


CM67 


Colon 


NAT 




NAT 




TX89 


Colon 


CAN 


Adenocarcin 
oma 


Adenocarcinoma 
of Transverse 
Colon 


Stave IV 


TX89 


Colon 


NAT 




NAT 




AS43 


Colon 


CAN 


Adenocarcin 
oma 


malignant 




AS43 


Colon 


NAT 


Adenocarcin 
oma 


NAT 




AS98 


Colon 


-CAN 


Adenocarcin 
oma 


Moderately to 
poorly 

differentiated 
adenocarcinoma 


Duke's C 


AS98 


Colon 


NAT 




NAT 
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RS53 


Colon 


CAN 


Adenocarcin 

oma 


moderately 

differentiated 

adenocarcinoma 




RS53 


Colon 


NAT 


Adenocarcin 
oma 


NAT 




RCOl 


Colon 


CAN 


Cancer 




Stage IV 


RCOl 


Colon 


NAT 




NAT 




SG27 


Colon 


CAN 




malig 


Stage B 


SG27 


Colon 


NAT 




NAT 




DC19 


Colon 


CAN 




T 


Stage B 


DC19 


Colon 


NAT 




NL 




401C 


Colon 


CAN 


Adenocarcin 
oma 


Adenocarcinoma 
of ascending 
colon and cecum 


Stage III 


401C 


Colon 


NAT 




NAT 




CM12 


Colon 


CAN 




T 


Stage D 


CM12 


Colon 


NAT 


Adenocarcin 
oma 


Nat 




TXOl 


Colon 


CAN 


Adenocarcin 
oma 


Moderately 
differentiated 
adenocarcinoma 
of cecum 


Stage II; 
T3NOMO 


TXOl 


Colon 


NAT 




NAT 




030B 


Urinary 
Bladder 


CAN 


Carcinoma 


invasive 

Carcinoma , poorly 
differentiated 


Stage 

III, Grade 3 


030B 


Urinary 
Bladder 


NAT 




NAT 




TR17 


Urinary 
Bladder 


CAN 


Carcinoma 


transitional 
cell carcinoma 


Stagell/Grade 
III 


TR17 


Urinary 
Bladder 


NAT 




NAT 




520B 


Urinary 
Bladder 


CAN 


Sarcomatoid 
transit iona 
1 cell 
carcinoma 


Sarcomatoid 
transitional 
cell carcinoma 




520B 


Urinary 
Bladder 


NAT 




NAT 




KS52 


Cervix 


CAN 


Squamous 
cell 

carcinoma 


Keratinizing 
Squamous Cell 
Carcinoma 


IIIB, well 
diff. Gl; 
T3bNxM0 


KS52 


Cervix 


NAT 




NAT 




NK23 


Cervix 


CAN 




Nonkeratinizing 
Large Cell 


FIGO I I IB, 
undiff. G4; 
T3bNxM0 


NK23 


Cervix 


NAT 




NAT 




NKS54 


Cervix 


CAN 


Squamous 

cell 

carcinoma 


Nonkeratinizing 

Squamous Cell 
Carcinoma 


I IB, mod 

diff. G2; 
T2bNxM0 


NKS54 


Cervix 


NAT 




NAT 
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NKS55 


Cervix 


CAN 


Squamous 

cell 

carcinoma 


Nonkerat i n i = i ng 

Squamous Cell 
Carcinoma 


IIIB, Mod 
diff. G2; 
T3bNxM0 


NKS55 


Cervix 


NAT 




NAT 




NKS81 


Cervix 


CAN 


Squamous 
cell 

carcinoma 


large cell 
nonkerat ini zing 
sq care, IIB, 
moderately diff 


IIB 


NKS81 


Cejrvix 


NAT 




NAT 




10479 


Endometr 
ium 


CAN 




malignant mixed 
mullerian tumor 


T?, NX, Ml 


10479 


Endometr 
ium 


NAT 




NAT 




28XA 


Endometr 
ium 


CAN 


Endometrial 
adenocarcin 
oma 


malignant 


II/III 


28XA 


Endometr 
ium 


NAT 




NAT 


II/III 


ova 


Endometr 

num 




mod. diff, 
invasive , 
squamous 
differentia 
tion, FIGO- 

TT 






8XA 


Endometr 
ium 


NAT 




NAT 




106XD 


Kidney 


CAN 


Renal cell 
carcinoma 


renal cell 
carcinoma, clear 
cell, localised 


3 


106XD 


Kidney 


NAT 




NL 




107XD 


Kidney 


CAN 


Renal cell 

carcinoma 


renal cell 
carcinoma, clear 
cell, with 
metastatic 


G III 


107XD 


Kidney 


NAT 




NL 




109XD 


Kidney 


CAN 




Malignant 


G III 


109XD 


Kidney 


NAT 




NL 




lOXD 


Kidney 


CAN 


Renal cell 
carcinoma 


renal cell 
carcinoma, clear 
cell, localized, 
grade 2-3 


3 


lOXD 


Kidney 


NAT 




NL 




22K 


Kidney 


CAN 


Renal cell 
carcinoma 


Renal cell 
carcinoma 


G2 , Mod . 
Diff. 


22K 


Kidney 


NAT 




NAT 




15XA 


Liver 


CAN 




Sarcoma , Retroper 
itoneal Tumor 


Grade -2 


15XA 


Liver 


NAT 




CA 


St. I, G4 


174L 


Liver 


CAN 


Hepatocellu 
lar 

carcinoma 


Moderate to well 
differentiated 
hepatocellular 
carcinoma 




174L 


Liver 


NAT 


Hepatocellu 
lar 

carcinoma 


NAT 





wo 2004/050860 



PCT/US2003/040063 



213 



187L 


Liver 


CAN 


Adenocarcin 
oma 


Metastatic 
Adenocarci noma 


Liver 

(Gallbladder) 


187L 


Liver 


NAT 




NAT 




205L 


Lung 


CAN 


Adenocarcin 

oma 


poorly 

differentiated 
adenocarcinoma 


T2, Nl, MX 


205L 


Lung 


NAT 




NAT 




315L 


Lung 


CAN 


Squamous 
cell 

carcinoma 






315L 


Lung 


NAT 


Adenocarcin 
oma 


NAT 




507L 


Lung 


CAN 


Bronchi oloa 

Iveolar 

carcinoma 


bronchioalveolar 
carcinoma 


Stage IB, Gl, 
well diff. 


507L 


Lung 


NAT 




NAT 




528Ii 


Lung 


CAN 


Adenocarcin 
oma 


Adenocar c inoma 


St.IV,T2N0Ml, 
infiltrating 
poorly diff. 


528L 


Lung 


NAT 




NAT 




883 7L 


Lung 


CAN 


Squamous 
cell 

carcinoma 


Squamous cell 

cei r c i nt/itio 


T2 MO MO 


8837L 


Lung 


NAT 




NAT 




ACll 


Lung 


CAN 


Adenocarcin 
oma 


poorly 

differentiated 
adenocarcinoma 


T2 , N2 , Ml 


ACll 


Lung 


NAT 




NAT 




AC39 


Lung 


CAN 


Adenocarcin 
oma 


intermediate 
grade 

adnocar cinema 


T2, N2, MX 


AC39 


Lung 


NAT 




NAT 




SQ80 


Lung 


CAN 


Squamous 
cell 

carcinoma 


poorly 

differentiated 
squamous cell 
carcinoma 


Tl, Nl, MO 


SQ80 


Lung 


NAT 




NAT 




SQ81 


Lung 


CAN 


Squamous 
cell 

carcinoma 


poorly 

differentiated 

squamous 

carcinoma 


T3, Nl, MX 


SQ81 


Lung 


NAT 




NAT 




19DN 


Mammary 


CAN 


Invasive 

ductal 
carcinoma 


Invasive ductal 
carcinoma 


G3 , Stage 
IIA; T2N0M0 


19DN 


Mammary 


NAT 




NAT 




42DN 


Mammary 


CAN 


Invasive 

ductal 

carcinoma 


Invasive Ductal 
Carcinoma 


T3aNlM0 II lA, 
G3 


42DN 


Mammary 


NAT 




NAT 




517 


•Mammary 


CAN 


Infiltratin 
g ductal 
carcinoma 


Infiltrating 
ductal carcinoma 


St. IIA, 03 


517 


Mammary 


NAT 




NAT 
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78 IM 


Mpmm^i T"\/ 
I icii kluioi X. y 


CAN 


Invasive 
ductal 

^cix wXimiiici 




Architectural 

grade- 

3/3, Nuclear 

err* A Ho • ^ / 


78 IM 


Mammary 


NAT 




NAT 




869M 


Mammary 


CAN 


Invasive 
carcinoma 


Invasive 
Carcinoma 


Stage IIA 
Gl ;T2NoMO 


869M 


Mammary 


NAT 




NAT 




976M 


Mammary 


CAN 


Invasive 

ductal 

carcinoma 


Invasive Ductal 
Carcinoma 


T2N1M0 (Stage 
2B Grade 2-3) 


976M 


Mammary 


NAT 




NAT 






I Id II 111 tax y 


CAN 


VvCix wxiiuiiia 


f aT"^S nnnia 


Stage 

IIA-TlNlMo 


S570 


i miliums jr 


NAT 




NAT 




S699 


Mammary 


CAN 


Invasive 

lobular 

carcinoma 


Invasive Lobular 
Carcinoma 


Stage IIB 
Gl ;T2NlMo 


S699 


Mammary 


NAT 




NAT 




S997 


Mammary 


CAN 


Invasive 

ductal 

carcinoma 


Invasive Ductal 
Carcinoma 


Stage IIB G3 ; 
T2N1MO 


S997 


Mammary 


NAT 




NAT 




G021 


Ovary 


CAN 


Carcinoma 


St. IIIC, poorly 
diff . 


Stage- IIIC, 
poorly dif f . 


G021 


Ovary 


NAT 




NAT 




1005O 


Ovary 


CAN 




papillary serous 

and endometrioid 

ovarian 

carcinoma, 

concurrent 

metastatic 

breast cancer 


3 


104 0O 


Ovary 


CAN 




papillary serous 

adeno, 

metastatic 




105O 


Ovary 


CAN 




Papillary Serous 
Carcinoma with 
Focal Mucinous 
Di f f erentia t ion 


ocage xc ou; 
TlcNOMO 


13 OX 


Ovarv 


CAN 




Ovarian cancer 




7180 


Ovary 


CAN 


Mucxiwcax Win 

oma 


ma 1 i gnan t tumor 


IIIC 


MB 


Ovary 


CAN 


oma 


CA 




1230 


Ovary 


NRM 




Normal 




18GA 


Ovary 


NRM 




NL 




2061 


Ovary 


NRM 




NL 




3370 


Ovary 


NRM 




Normal 




4 0G 


Ovary 


NRM 




NL 




5150 


Ovary 


NRM 




Normal 




C004 


Ovary 


NRM 




NL 




C177 


Ovary 


NRM • 




several fluid 
filled cysts 
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71XL 


Pancreas 


CAN 




villous adenoma 
with paneth cell 
metaplasia 


localized 


71XL 


Pancreas 


NAT 




NL 




82XP 


Pancreas 


CAN 




serious 
cystadenoma 




82XP 


Pancreas 


NAT 




NL 




92X 


Pancreas 


CAN 


Ductal 

adenocarcin 

oma 


ductal 

adenocarcinoma 


mod to 
focally 
pooriy oiiL. 


92X 


Pancreas 


NAT 




NL 




23B 


Prostate 


CAN 




Prostate tumor 


Gleason's 3+4 


23B 


Prostate 


NAT 




NAT 




65XB 


Prostate 


CAN 


Adenocarcin 
oma 


adenocarcinom 


3+4=7 


65XB 


Prostate 


NAT 




NL 




675P 


Prostate 


UAN 


Adenocarcin 
oma 


acienocorcinoma 




f en 
Of or 


Prostate 


IMAl 




IMOXlUClX 




D ^ YD 

OH Ad 


D >* o ^ 3 ^ ^ 




Adenocarcin 
oma 


clUtrilU(_.cii. (oX 11 Will 


2 + 3 


84XB 


Prostate 


NAT 




NL 




Q e O D 


Froscane 




Adenocarcin 
otna 




T2C NO MX 


958? 


Prostate 


NAT 


NAT 


Normal 




263C 


Prostate 


BPH 




BPH 




276P 


Prostate 


BPH 




BPH 




767B 


Prostate 


BPH 




prostate BPH 




855P 


Prostate 


BPH 




BPH 




lOR 


Prostate 


PROST 




active chronic 
prostatitis 


TO. NO, MO 


2 OR 


Prostate 


PROST 




PROSTATITIS 




287S 


Skin 


CAN 


Squamous 
cell 

carcinoma 


Invasive 
Keratinizing 
Squamous Cell 
Carcinoma 


Moderately 
Differentiate 

d 


287S 


Skin 


NAT 




NAT 




3 9A 


Skin 


CAN 




CA 


St. II 


39A 


Skin 


NAT 




CA 


St. II 


669S 


Skin 


CAN 


Melanoma 


Nodular 

malignant 
melanoma 




669S 


Skin 


NAT 




NAT 




171S 


Small 

Intestin 

e 


CAN 


Adenocarcin 
oma 


Moderately 
differentiated 
Adenocarcinoma, 
invasive 




171S 


Small 

Intestin 

e 


NAT 




NAT 




20SM 


Small 

Intestin 

e 


CAN 


Adenocarcin 
oma 


Adenocarcinoma , 
me ta Stic to lung 
& liver 


St. IV, 
poorly diff . 
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20SM 


Small 

Intestin 

e 


NAT 




NAT 




H89 


Smal 1 

Intestin 

e 


CAN 


Adenocarcin 
oma 


Adenocarcimoa 


80% tumor, 
50% necrosis, 
.moderately 

d, G2-3; 
T3N1MX 


H89 


Small 

Intestin 

e 


NAT 


Adenocarcin 
oma 


NAT 




261S 


Stomach 


CAN 


Signet-ring 
cell 

carcinoma 


Signet-ring cell 

carcinoma 


Stage IIIA, 

T3N1M0 




Stomach 


NA X 




Vifii 




288S 


Stomach 


CAN 


Adenocarcin 
oma 


Infiltrating 
Adneocarcinoma 


Moderately 
Di f f erentiate 
d 


288S 


Stomach 


NAT 




NAT 




AC93 or 
509L 


Stomach 


CAN 


Adenocarcin 
oma 


Adenocarcinoma 


St. IV, 04, 

T4N3M0, 
poorly diff. 


AC93 or 
509L 


Stomach 


NAT 




NAT 




88S 


Stomach 


CAN 


Adenocarcin 
oma 


Mucinous 
adenocarcinoma 


T3N1M0, St. 
IIIA 


88S 


Stomach 


NAT 




NAT 




143N 


Thyroid 
Gland 


CAN 


Follicular 
carcinoma 


Follicular 
Carcinoma 




143N 


Thyroid 
Gland 


NAT 




NAT 




270T 


Thyroid 
Gland 


CAN 




CA 




270T 


Thyroid 
Gland 


NAT 




NAT 




56T 


Thyroid 
Gland 


CAN 


Papillary 
carcinoma 


Papillary 
Carcinoma 


St. Ill; 
T4N1M0 


56T 


Thyroid 
Gland 


NAT 




NAT 




3 9X 


Testes 


CAN 




CA 




39X 


Testes 


NAT 




NAT 




647T 


Testes 


CAN 


Teratocarci 

noma 


Teratocarcinoma 


Stage lA 


64 7T 


Testes 


NAT 


Teratocarci 
noma 


NAT 




663T 


Testes 


CAN 


Teratocarci 
noma 


Teratocarcinoma 




663T 


Testes 


NAT 




NAT 




135X0 


Uterus 


CAN 




Uterus normal 




135X0 


Uterus 


NAT 




Uterus tumor 




85XU 1 


Uterus 


CAN 




endometrial 
carcinoma 


I 


85XU 1 Uterus 


NAT 




NL 
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Bl 


Blood 1 NRM 




Normal 




B3 


Blood 


NRM 




Normal 




B5 


olOOu 


HPM 




Normal 




B6 


BlOOu 






Normal 




Bl 1 


Blood 


NRM 




Normal 




982B 


Blood 


NRM 




Normal 




4 SAD 


Adrenal 
Gland 


NRM 




Nozrmal 




lOBR 


Brain 


NRM 




Normal 




OICL 


Colon 


NRM 




Normal 




06 CV 


Cervix 


NRM 




Normal 




OlES 


Esophagu 

' s 


NRM 




Normal 




4onR 


Heart 


NRM 








OOHR 


Human 
Ref erenc 
e 


CAN 


CAN 


Cancer pool 




55KD 


Kidney 


NRM 




Normal 




89LV 


Liver 


NRM 




Normal 




90LN 


Lung 


NRM 




Normal 




OlMA 


Mammary 


NRM 




Normal 




84MU 


Skeletal 
Muscle 


NRM 




Normal 




J Ar V 


Ovary 


NRM 




Normal 






Pancreas 


NRM 




Normal 




59PL 


Placenta 


NRM 




Normal 




n ODD 


Prostate 


NRM 




Normal 








NRM 




Normal 




59SM 


Small 

Intestin 

e 


NRM 




Normal 




7GSP 


Spleen 


NRM 




Normal 




09ST 


Stomach 


NRM 




Normal 




4 GTS 


Testes 


NRM 




Normal 




99TM 


Thymus 
Gland 


NRM 




Normal 




16TR 


Trachea 


NRM 




Normal . 




57UT 


Uterus 


NRM 




Normal 





DEX>Q448 026.nt.l fCln2S91 

The relative expression level of Cln259 in various tissue samples is included below. 
Tissue samples include 78 pairs of matching samples, 7 non matched cancer samples, and 
5 35 normal samples, all from various tissues annotated in the table. A matching pair is 
formed by mRNA from the cancer sample for a particular tissue and mRNA from the 
nomial adjacent sample for that same tissue from the same individual. Of the normal 
samples 5 were blood samples which measured the expression levels in blood cells. 
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Additionally, 2 prostatitis, and 4 Benign Prostatic Hyperplasia (BPH) samples are 
included. All the values are compared to normal colon sample CLNOICL (calibrator). 

The table below contains the relative expression level values for the sample as 
compared to the calibrator. The table includes the Sample ID and expression level values 
5 for the following samples: Cancer (CAN), Normal Adjacent Tissue (NAT), Normal Tissue 
(NRM), Benign Prostatic Hyperplasia (BPH), and Prostatitis (PROST). 



Sample IDi 


CAN 


NAT 


NRM 


BPH 


PROST 


CLNAS12 1 


0.74 


1.22 








CLNAS4 6 


1.13 


1.92 








CLNB34 i 


0.29 


0.38 








CLNC9XR 


0.52 1 


0.94 








CLNCM67 1 


0.98 


0. 97 








CLNTX89 


0.93 


0.37 








CLNAS4 3 


7.88 


1.10 








CLNAS98 1 


3.38 


1.22 








CLNRS53 


0.33 


0. 82 








CLNRCOl 1 


1.25 1 


1.47 








CLNSG2 7 


1 . 12 _j 


2 . 04 








CLNDC19 ! 


2.80 


1 . 19 


1 






CLN4 01C 


1.18 


1.64 








CLNCM12 i 


0.50 


1 . 14 








CLNTXOl 1 


1.31 


3.49 








BLD030B 


0.46 


0.70 








BLD520B 1 


2.07 


0.26 








BLDTR17 ! 


1.16 


1.25 








CVXKS52 ] 


4 .74 


1.34 








CVXNK23 


4 .04 


16.66 








CVXNKS54 


2.96 J 


0.81 








CVXNKS55 


2 .63 1 


5.01 








CVXNKS81 


6.61 


12 .74 








ENDO10479 


6.39 


1.17 








END028XA 


2.85 


6.33 








END08XA 


1.44 


0.47 








KID106XD 


0 . 87 


0.46 








KID107XD 


1.42 


0.91 








KID109XD 


1.15 J 


0.85 








KIDIOXD 


0.45 1 


0.12 








KID22K 


0. 80 I 


0.28 








LNG205L 


2 .21 1 


1.94 








LNG315L 


1.87 1 


1.88 








LNG507L 


2.04 


2.43 








LNG528L 


4 .22 


1.49 








LNG8837L 


1.69 1 


2.24 








LNGACll 


2.36 J 


2.71 
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LNGAC39 


4.81 1 


1.51 








LNGS080 


1-23 1 


1.59 








LNGSQ81 


4.06 J 


2.94 








LVR15XA 


1.10 1 


1.05 








LVR174L 


1.38 J 


2.97 








LVR187L 


3 .46 1 


2.44 








MAM19DN 


2.90 i 


2.01 








MAM4 2DN 


4.40 1 


2.31 








MAM517 


13 .65 J 


2.05 








MAM781M 


1.62 1 


0.39 








MAM869M 


30.36 j 


0.86 








MAM976M 


3 .23 1 


1.06 








MAMS570 


1.53 1 


3 .30 








MAMS699 


0.50 1 


1.44 








MAMS997 


3,39_J 


1.32 








OVRG021 


1.48 1 


0.47 








OVR1005O 


5.96 










OVR104 0O 


11.10 










OVR105O 


1.58 










OVR130X 


17.74 










OVR7180 


3.55 










OVRAiB 


4 .85 1 










OVR1230 






0,52 






0VR18GA 






0.30 






OVR206I 






0.26 






0VR3 3 7O 






0.00 






0VR4 OG 






0.39 






OVR5150 






0.32 






OVRC004 






3 .88 






OVRC177 






0.48 






PAN71XL 


1.22 i 


0.46 








PAN82XP 


1.00 j 


1-32 








PAN92X i 


9.27 


0.85 








PR02 3B 


1.62 


2.06 








PR065XB 


0.65 


1.30 








PR0675P 


2.12 


1.79 








PR084XB 


1.38 


1.64 








PR0958P 


1 .80 


1.03 








PR0263C 








1.38 




PR0276P 








1.12 




PR0767B 








1.88 




PR0855P 








1.40 




PROIOR 










0.87 


PRO20R 










0.75 


SKN287S 


1 .13 


1.47 








SKN39A 


0.22 


0.62 








SKN669S 


0.41 


1.28 
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|SMINT171S 


2.91 1 


1.41 








|SMINT20SM 


4-90 1 


2.06 




— 




SMINTH89 


0.99 1 


1.26 








ST0261S 


4.23 J 


1.20 








ST02 88S 


1 .07_| 


1.22 








ST088S 


2 .78 1 


1.17 








THRD143N 


0.93 ! 


1.37 








THRD270T 


2.11 t 


2.42 








THRD56T 


4 .34 1 


1.52 








TST39X 


2.50 


2.82 








TST647T 


4 .81 _| 


2.35 








TST663T 


^ . 19 1 


3 . 04 








UTR135XO 


.... 

[1.14 j 


1.18 








UTR85XU 


|3.40 ! 


1.57 








BLOBl 


1 




0.00 






BL0B3 


1 




0.63 






BLOB 6 


1 i 




0.55 






BLOB 11 


1 i 




0.58 






BL0982B 


1 1 




0.00 






ADR4 8AD 


1 ! 




0.90 






HUMREFOOH 
R 


jl . 14 










BRNIOBR 


1 




0.10 






CLNOICL 


1 i 




1 .00 






ESOOIES 


1 i 




0.73 






HRT4 6HR 


1 




0.02 






KID55KD 


1 




0.12 






LVR89LV 


1 i 




3 . 12 






LNG90LN 


1 ! 




1.69 






MAMOIMA 


1 i 




0.53 






MSL84MU II I 




0. 02 






0VR3APV 


1 i 




0.48 






PAN04PA 


1 ! 




0.75 






PLA59PL 


I 




1.66 






PRO09PR 


1 




0.87 






REC21RC 


1 




4 . o J 






SMINT59SM 


1 




0.30 






SPL7GSP 


1 




0.90 






ST009ST 


1 




1.59 






THYM99TM 


1 




0,20 






TRA16TR 


1 




1.12 






TST4GTS 


1 ll 


2.95 






UTR57UT 


1 11 


0.77 







0.00= Negative or Not Detected 



The sensitivity for Cln259 expression was calculated for the cancer samples versus 
normal samples. The sensitivity value indicates the percentage of cancer samples that 
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show levels of Cln259 at least 2 fold higher than the normal tissue or the corresponding 
normal adjacent form the same patient. 

This specificity is an indication of the level of colon tissue specific expression of 
the transcript compared to all the other tissue types tested in our assay. Thus, these 
5 experiments indicate Cln259 being useful as an colon cancer diagnostic marker and/or 
therapeutic target. 

Sensitivity and specificity data is reported in the table below. 





CLN 


1 LNG ] 


MAM 


OVR 


PRO 


Sensitivity, Up vs. NAT 


27% 


|22% 1 


56% 


0% 


0% 


Sensitivity, Down vs. 
NAT 


20% 


|o% 


22% 


0% 


20% 


Sensitivity, Up vs. NRM 


20% 


|33% 


89% 


100% 


40% 


Sensitivity, Down vs. 
NRM 


20% 


|0% 


0% 


0% 


0% 


Specificity 


3.47 
% 


Is. 41 
1% 


9.19 
% 


12.3 
% 


4 .28 
% 



Altogether, the tissue specificity, plus the mRNA differential expression in the 
1 0 samples tested are believed to make Cln259 a good marker for diagnosing, monitoring, 

staging, imaging and treating colon cancer. 

Additionally, the the tissue specificity, plus the mRNA differential expression in 

the samples tested are believed to make Cln259 a good marker for diagnosing, monitoring, 

staging, imaging and treating ovarian cancer and/or breast cancer, 
1 5 Primers used for QPCR Expression Analysis of Cln259 are as follows: 

(Cln259_forward): TACGCAGAGCTCATCGTCCTT (SEQ ID NO:238) 
(Cln259_reverse): ACAACCACGAAGAGCCAGTCTT (SEQ ID NO:239) 
(an259_probe): TGGCTGAGCTCTTACCTGGTTTTCAGGC (SEQ ID NO:240) 

Conclusions 

20 Altogether, the high level of tissue specificity, plus the mRNA overexpression in 

matched samples tested are indicative of SEQ ID NO: 1-95 being a diagnostic marker 
and/or a therapeutic target for cancer. 

Example 3: Protein Expression 

The CSNA is amplified by polymerase chain reaction (PCR) and the amplified 
25 DNA fragment encoding the CSNA is subcloned in pET-21d for expression in E. coli. In 
addition to the CSNA coding sequence, codons for two amino acids, Met-Ala, flanking the 
NH2-terminus of the coding sequence of CSNA, and six histidines, flanking the 
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COOH-lerminus of the coding sequence of CSNA, are incorporated to serve as initiating 
Met/restriction site and purification tag, respectively. 

An over-expressed protein band of the appropriate molecular weight may be 
observed on a Coomassie blue stained polyacrylamide gel. This protein band is confirmed 
5 by Westem blot analysis using monoclonal antibody against 6X Histidine tag. 

Large-scale purification of CSP is achieved using cell paste generated from 6-Iiter 
bacterial cultures, and purified using immobilized metal affinity chromatography (IMAC). 
Soluble fractions that are separated from total cell lysate were incubated with a nickel 
chelating resin. The column is packed and washed with five column volumes of wash 
10 buffer. CSP is eluted stepwise with various concentration imidazole buffers. 

Example 4: Fusion Proteins 

The human Fc portion of the IgG molecule can be PCR amplified, using primers 
that span the 5'and 3' ends of the sequence described below. These primers also should 
have convenient restriction enzyme sites that will facilitate cloning into an expression 

15 vector, preferably a mammalian expression vector. For example, if pC4 (Accession No, 
209646) is used, the human Fc portion can be ligated into the BamHI cloning site. Note 
that the 3' BamHI site should be destroyed. Next, the vector containing the human Fc 
portion is re-restricted with BamHI, linearizing the vector, and a polynucleotide of the 
present invention, isolated by the PCR protocol described in Example 2, is ligated into this 

20 BamHI site. Note that the polynucleotide is cloned without a stop codon, otherwise a 
fusion protein will not be produced. If the naturally occurring signal sequence is used to 
produce the secreted protein, pC4 does not need a second signal peptide. Alternatively, if 
the naturally occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. 5ee, e.g., WO 96/34891. 

25 Example 5: ProductioD of an Antibody from a Polypeptide 

In general, such procedures involve immunizing an animal (preferably a mouse) 
with polypeptide or, more preferably, with a secreted polypeptide-expressing cell Such 
cells may be cultured in any suitable tissue culture medium; however, it is preferable to 
cuhure cells in EaHe's modified Eagle's medium supplemented with 10% fetal bovine 
30 serum (inactivated at about Se^'C), and supplemented with about 10 g/1 of nonessential 
amino acids, about 1,000 U/ml of penicillin, and about 100, pg/ml of streptomycin. The 
splenocyles of such mice are extracted and fused with a suitable myeloma cell line. Any 
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suitable myeloma cell line may be employed in accordance with the present invention; 
however, it is preferable to employ the parent myeloma cell line (SP20), available from 
the ATCC. After fusion, the resulting hybridoma cells are selectively maintained in HAT 
medium, and then cloned by limiting dilution as described by Wands et ai, 
5 Gastroenterology 80: 225-232 (1981). 

The hybridoma cells obtained through such a selection are then assayed to identify 
clones which secrete antibodies capable of binding the polypeptide. Alternatively, 
additional antibodies capable of binding to the polypeptide can be produced in a two-step 
procedure using anti-idiotypic antibodies. Such a method makes use of the fact that 

10 antibodies are themselves antigens, and therefore, it is possible to obtain an antibody 
which binds to a second antibody. In accordance with this method, protein specific 
antibodies are used to immunize an animal, preferably a mouse. The splenocytes of such 
an animal are then used to produce hybridoma cells, and the hybridoma cells are screened 
to identify clones which produce an antibody whose ability to bind to the protein-specific 

1 5 antibody can be blocked by the polypeptide. Such antibodies comprise anti-idiotypic 
antibodies to the protein specific antibody and can be used to immunize an animal to 
induce formation of further protein-specific antibodies. 

Example 6: Method of Determining Alterations in a Gene Corresponding to a 
Polynucleotide 

20 RNA is isolated from individual patients or from a family of individuals that have 

a phenotype of interest. cDNA is then generated from these RNA samples using protocols 
known in the art. See, Sambrook (2001), supra. The cDNA is then used as a template for 
PCR, employing primers surrounding regions of interest in SEQ ID NO: 1-95. Suggested 
PCR conditions consist of 35 cycles at 95'*C for 30 seconds; 60-120 seconds at 52-58°C; 

25 and 60-1 20 seconds at 70*^0, using buffer solutions described in Sidransky et al, Science 
252(5006): 706-9 (1991). See also Sidransky et al. Science 278(5340): 1054-9 (1997). 

PCR products are then sequenced using primers labeled at their 5* end with T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). The 
intron-exon borders of selected exons are also determined and genomic PCR products 

30 analyzed to confirm the results. PCR products harboring suspected mutations are then 
cloned and sequenced to validate the results of the direct sequencing. PCR products is 
cloned into T-tailed vectors as described in Holton et aL, Nucleic Acids Res., 19: 1 156 
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(1991) and sequenced with T7 polymerase (United States Biochemical). Affected 
individuals are identified by mutations not present in unaffected individuals. 

Genomic rearrangements may also be determined. Genomic clones are 
nick-translated with digoxigenin deoxyuridine 5' triphosphate (Boehringer Manheim), and 
5 FISH is performed as described in Johnson et ah. Methods Cell Biol 35: 73-99 (1991). 
Hybridization with the labeled probe is carried out using a vast excess of human cot-1 
DNA for specific hybridization to the corresponding genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenyIidole and propidium 
iodide, producing a combination of C-and R-bands. Aligned images for precise mapping 

10 are obtained using a triple-band filter set (Chroma Technology, Brattleboro, VT) in 

combination with a cooled charge-coupled device camera (Photometries, Tucson, AZ) and 
variable excitation wavelength filters. Johnson (1991). Image collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of the 

1 5 genomic region hybridized by the probe are identified as insertions, deletions, and 

translocations. These alterations are used as a diagnostic marker for an associated disease. 

Example 7: Method of Detecting Abnormal Levels of a Polypeptide in a Biological 
Sample 

Antibody-sandwich ELISAs are used to detect polypeptides in a sample, preferably 
20 a biological sample. Wells of a microtiter plate are coated with specific antibodies, at a 
final concentration of 0.2 to 10 ug/ml. The antibodies are either monoclonal or polyclonal 
and are produced by the method described above. The wells are blocked so that 
non-specific binding of the polypeptide to the well is reduced. The coated wells are then 
incubated for > 2 hours at RT with a sample containing the polypeptide. Preferably, serial 
25 dilutions of the sample should be used to validate results. The plates are then washed 

three times with deionized or distilled water to remove unbound polypeptide. Next, 50 pi 
of specific antibody-alkaline phosphatase conjugate, at a concentration of 25-400 ng, is 
added and incubated for 2 hours at room temperature. The plates are again washed three 
times with deionized or distilled water to remove unbound conjugate. 75 pi of 
30 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl phosphate (NPP) substrate 
solution are added to each well and incubated 1 hour at room temperature. 
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The reaction is measured by a microtiter plate reader. A standard curve is 
prepared, using serial dilutions of a control sample, and polypeptide concentrations are 
plotted on the X-axis (log scale) and fluorescence or absorbance on the Y-axis (linear 
scale). The concentration of the polypeptide in the sample is calculated using the standard 
5 curve. 

Example 8: Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed in a fashion 
consistent with good medical practice, taking into account the clinical condition of the 
individual patient (especially the side effects of treatment with the secreted polypeptide 

10 alone), the site of delivery, the method of administration, the scheduling of administration, 
and other factors known to practitioners. The "effective amount" for purposes herein is 
thus determined by such considerations. 

As a general proposition, the total pharmaceutically effective amount of secreted 
polypeptide administered parenterally per dose will be in the range of about 1, jig/kg/day 

15 to 10 mg/kg/day of patient body weight, although, as noted above, this will be subject to 
therapeutic discretion. More preferably, this dose is at least 0.0 1 mg/kg/day, and most 
preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If given 
continuously, the secreted polypeptide is typically administered at a dose rate of about 1 
jag/kg/hour to about 50 mg/kg/hour, either by 1-4 injections per day or by continuous 

20 subcutaneous infiisions, for example, using a mini-pump. An intravenous bag solution 
may also be employed. The length of treatment needed to observe changes and the 
inteA'al following treatment for responses to occur appears to vary depending on the 
desired effect. 

Pharmaceutical compositions containing the secreted protein of the invention are 
25 administered orally, rectally, parenterally, intracistemally, intravaginally, intraperitoneally, 
topically (as by powders, ointments, gels, drops or transdermal patch), bucally, or as an 
oral or nasal spray. "Pharmaceutically acceptable carrier" refers to a non-toxic solid, 
semisolid or liquid filler, diluent, encapsulating material or formulation auxiliary of any 
type. The term "parenteral" as used herein refers to modes of administration which 
30 include intravenous, intramuscular, intraperitoneal, intrastemal, subcutaneous and 
intraarticular injection and infusion. 
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The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semipermeable 
polymer matrices in the form of shaped articles, e.g., films, or microcapsules. Sustained- 
release matrices include polylactides (U. S. Pat. No.3, 773,919, EP 58,481, the contents of 
5 which are hereby incorporated by reference herein in their entirety), copolymers of L- 
glutamic acid and gamma-ethyl-L-glutamate (Sidman, U. et al., Biopolymers 22: 547-556 
(1983)), poly (2-hydroxyethyl methacrylate) (R. Langer et al., J. Biomed. Mater. Res. 15: 
167-277 (1981), and R. Langer, Chem. Tech. 12: 98-105 (1982)), ethylene vinyl acetate 
(R. Langer et al.) or poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained-release 

10 compositions also include liposomally entrapped polypeptides. Liposomes containing the 
secreted polypeptide are prepared by methods known per se: DE Epstein et al., Proc. Natl. 
Acad. Sci. USA 82: 3688-3692 (1985); Hwang et al.. Proc. Natl. Acad. Sci. USA 77: 
4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese 
Pat. Appl. 83-1 18008; U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102.324, the 

1 5 contents of which are hereby incorporated by reference herein in their entirety. Ordinarily, 
the liposomes are of the small (about 200-800 Angstroms) unilamellar type in which the 
lipid content is greater than about 30 mol. percent cholesterol, the selected proportion 
being adjusted for the optimal secreted polypeptide therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 

20 formulated generally by mixing it at the desired degree of purity, in a unit dosage 

injectable form (solution, suspension, or emulsion), with a phamiaceutically acceptable 
carrier, i.e., one that is non-toxic to recipients at the dosages and concentrations employed 
and is compatible with other ingredients of the formulation. 

For example, the formulation preferably does not include oxidizing agents and 

25 other compounds that are known to be deleterious to polypeptides. Generally, the 

formulations are prepared by contacting the polypeptide uniformly and intimately with 
liquid carriers or finely divided solid carriers or both. Then, if necessary, the product is 
shaped into the desired formulation. Preferably, the carrier is a parenteral carrier, more 
preferably, a solution that is isotonic with the blood of the recipient. Examples of such 

30 carrier vehicles include water, saline, Ringer*s solution, and dextrose solution. Non- 
aqueous vehicles such as fixed oils and ethyl oleate are also useful herein, as well as 
liposomes. 
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The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at the 
dosages and concentrations employed, and include buffers such as phosphate, citrate, 
succinate, acetic acid, and other organic acids or their salts; antioxidants such as ascorbic 
5 acid; low molecular weight (less than about ten residues) polypeptides, e. g., polyarginine 
or tripeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic 
polymers such as polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, 
aspartic acid, or arginine; monosaccharides, disaccharides, and other carbohydrates 
including cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such 

10 as EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as sodium; and/or 
nonionic surfactants such as polysorbates, poloxamers, or PEG. 

The secreted polypeptide is typically formulated in such vehicles at a concentration 
of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 3 to 8. It will 
be understood that the use of certain of the foregoing excipients, carriers, or stabilizers 

1 5 will result in the formation of polypeptide salts. 

Any pol>'peptide to be used for therapeutic administration can be sterile. Sterility 
is readily accomplished by filtration through sterile filtration membranes (e.g., 0.2 micron 
membranes). Therapeutic polypeptide compositions generally are placed into a container 
having a sterile access port, for example, an intravenous solution bag or vial having a 

20 stopper pierceable by a hj^jodermic injection needle. 

Pol>'peptides ordinarily will be stored in unit or multi-dose containers, for 
example, sealed ampules or vials, as an aqueous solution or as a lyophilized formulation 
for reconstitution. As an example of a lyophilized formulation, 10-ml vials are filled with 
5 ml of sterile-filtered 1 % (w/v) aqueous polypeptide solution, and the resulting mixture 

25 is lyophilized. The infusion solution is prepared by reconstituting the lyophilized 
polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the pharmaceutical compositions 
of the invention. Associated with such container (s) can be a notice in the form prescribed 

30 by a governmental agency regulating the manufacture, use or sale of pharmaceuticals or 
biological products, which notice reflects approval by the agency of manufacture, use or 
sale for human administration. In addition, the polypeptides of the present invention may 
be employed in conjunction with other therapeutic compounds. 
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Example 9: Method of Treating Decreased Levels of the Polypeptide 

It will be appreciated that conditions caused by a decrease in the standard or 

normal expression level of a secreted protein in an individual can be treated by 

administering the polypeptide of the present invention, preferably in the secreted form. 
5 Thus, the invention also provides a method of treatment of an individual in need of an 

increased level of the polypeptide comprising administering to such an individual a 

pharmaceutical composition comprising an amount of the polypeptide to increase the 

activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily dose 
10 0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the polypeptide is 

in the secreted form. The exact details of the dosing scheme, based on administration and 

formulation, are provided above. 

Example 10: Method of Treating Increased Levels of the Polypeptide 

Antisense or RNAi technology are used to inhibit production of a polypeptide of 
1 5 the present invention. This technology is one example of a method of decreasing levels of 
a polypeptide, preferably a secreted form, due to a variety of etiologies, such as cancer 

For example, a patient diagnosed with abnormally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 2.0 
and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest period if the 
20 treatment was well tolerated. The formulation of the antisense polynucleotide is provided 
above. 

Example 11: Method of Treatment Using Gene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 

25 subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
separated into small pieces. Small chunks of the tissue are placed on a wet surface of a 
tissue culture flask, approximately ten pieces are placed in each flask. The flask is turned 
upside down, closed tight and left at room temperature over night. After 24 hours at room 
temperature, the flask is inverted and the chunks of tissue remain fixed to the bottom of 

30 the flask and fresh media (e. g., Ham's Fl 2 media, with 1 0% FBS, penicillin and 

streptomycin) is added. The flasks are then incubated at 37^C for approximately one week. 
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At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and scaled into larger flasks, pMV-7 (Kirschmeier, P. T. et al., 
DNA, 7: 219-25 (1988)), flanked by the long terminal repeats of the Moloney murine 
5 sarcoma virus, is digested with EcoRI and Hindlll and subsequently treated with calf 
intestinal phosphatase. The linear vector is fractionated on agarose gel and purified, using 
glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified using 
PCR primers which correspond to the 5'and 3*end sequences respectively as set forth in 

10 Example 3. Preferably, the 5'primer contains an EcoRI site and the 3'primer includes a 
Hindlll site. Equal quantities of the Moloney murine sarcoma virus linear backbone and 
the amplified EcoRI and Hindlll fragment are added together, in the presence of T4 DNA 
ligase. The resulting mixture is maintained under conditions appropriate for ligation of the 
two fragments. The ligation mixture is then used to transform bacteria HB 101, which are 

1 5 then plated onto agar containing kanamycin for the purpose of confirming that the vector 
has the gene of interest properiy inserted. 

The amphotropic pA317 or GP+aml2 packaging cells are grown in tissue culture to 
confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% calf serum 
(CS), penicillin and streptomycin. The MSV vector containing the gene is then added to 

20 the media and the packaging cells transduced with the vector. The packaging cells now 
produce infectious viral particles containing the gene (the packaging cells are now referred 
to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the media 
is harvested from a 10 cm plate of confluent producer cells. The spent media, containing 

25 the infectious viral particles, is filtered through a millipore filter to remove detached 
producer cells and this media is then used to infect fibroblast cells. Media is removed 
from a sub-confluent plate of fibroblasts and quickly replaced with the media from the 
producer cells. This media is removed and replaced with fresh media. 

If the titer of virus is high, then virtually all fibroblasts will be infected and no 

30 selection is required. If the titer is very low, then it is necessary to use a retroviral vector 
that has a selectable marker, such as neo or his. Once the fibroblasts have been efficiently 
infected, the fibroblasts are analyzed to determine whether protein is produced. 
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The engineered fibroblasts are then transplanted onto the host, either alone or after 
having been grown to confluence on cytodex 3 microcarrier beads. 

Example 12: Method of Treatment Using Gene Tberapy-In Vivo 

Another aspect of the present invention is using zn vivo gene therapy methods to 
5 treat disorders, diseases and conditions. The gene therapy method relates to the 

introduction of naked nucleic acid (DNA, RN A, and antisense DNA or RNA) sequences 
into an animal to increase or decrease the expression of the polypeptide. 

The polynucleotide of the present invention may be operatively linked to a 
promoter or any other genetic elements necessary for the expression of the polypeptide by 

10 the target tissue. Such gene therapy and delivery techniques and methods are known in 
the art, see, for example, Tabata H. et al Cardiovasc. Res, 35 (3): 470-479 (1997); Chao J 
ef aL Pharmacol. Res. 35 (6): 517-522 (1997); Wolff J. A. Neuromuscul Disord. 1 (5): 
314-318 (1997), Schwartz B. et al Gene Tlier. 3 (5): 405-41 1 (1996); and Tsurumi Y. ei 
al. Circulatiou 94 (12): 3281-3290 (1996); WO 90/1 1092, WO 98/1 1 779; U. S. Patent No. 

15 5,693,622; 5,705,151 ; 5,580,859, the contents of which are hereby incorporated by 
reference herein in their entirety. 

The polynucleotide constructs may be delivered by any method that delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial space of 
tissues (heart, muscle, skin, colon, liver, intestine and the like). The polynucleotide 

20 constructs can be delivered in a pharmaceutically acceptable liquid or aqueous carrier. 

The term "naked" polynucleotide, DNA or RNA, refers to sequences that are free 
from any delivery vehicle that acts to assist, promote, or facilitate entry into the cell, 
including viral sequences, viral particles, liposome formulations, lipofectin or precipitating 
agents and the like. However, the polynucleotides of the present invention may also be 

25 delivered in liposome formulations (such as those taught in Feigner P. L. et al Ann. NY 
Acad. ScL 111: 126-139 (1995) and Abdallah B. et al Biol Cell 85 (1): 1-7 (1995)) which 
can be prepared by methods well known to those skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are 
preferably constructs that will not integrate into the host genome nor will they contain 

30 sequences that allow for replication. Any strong promoter known to those skilled in the art 
can be used for driving the expression of DNA. Unlike other gene therapies techniques, 
one major advantage of introducing naked nucleic acid sequences into target cells is the 
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transitory nature of the polynucleotide synthesis in the cells. Studies have sho\vn that non- 
replicating DNA sequences can be introduced into cells to provide production of the 
desired polypeptide for periods of up to six months. 

The polynucleotide construct can be delivered to the interstitial space of tissues 
5 within the an animal, including of muscle, skin, brain, colon, liver, spleen, bone marrow, 
thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, stomach, 
intestine, testis, ovary, utenis. rectum, nervous system, eye, gland, and connective tissue. 
Interstitial space of the tissues comprises the intercellular fluid, mucopolysaccharide 
matrix among the reticular fibers of organ tissues, elastic fibers in the walls of vessels or 
1 0 chambers, collagen fibers of fibrous tissues, or that same matrix within connective tissue 
ensheathing muscle cells or in the lacunae of bone. It is similarly the space occupied by 
the plasma of the circulation and the bmph fluid of the lymphatic channels. Delivery to 
the interstitial space of muscle tissue is preferred for the reasons discussed below. They 
may be conveniently delivered by injection into the tissues comprising these cells. They 
1 5 are preferably delivered to and expressed in persistent, non-dividing cells which are 

differentiated, although delivery and expression may be achieved in non-differentiated or 
less completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competem in their ability to take up and 
express polynucleotides. 
20 For the naked polynucleotide injection, an effective dosage amount of DNA or 

RNA will be in the range of from about 0.05 ng/kg body weight to about 50 mg/kg body 
weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 mg/kg and 
more preferably from about 0.05 mgAcg to about 5 mg/kg. Of course, as the artisan of 
ordinary skill will appreciate, this dosage will vary according to the tissue site of injection. 
The appropriate and effective dosage of nucleic acid sequence can readily be determined 
by those of ordinary skill in the art and may depend on the condition being treated and the 
route of administration. The preferred route of administration is by the parenteral route of 
injection into the interstitial space of tissues. However, other parenteral routes may also 
be used, such as, inhalation of an aerosol formulation particularly for delivery to colons or 
30 bronchial tissues, throat or mucous membranes of the nose. In addition, naked 

polynucleotide constructs can be delivered to arteries during angioplasty by the catheter 
used in the procedure. 



25 
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The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard 
recombinant DNA methodology. The template DNA, which may be either circular or 
5 linear, is either used as naked DNA or complexed with liposomes. The quadriceps 
muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made on the 
anterior thigh, and the quadriceps muscle is directly visualized. The template DNA is 
10 injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge needle over one minute, 
approximately 0.5 cm from the distal insertion site of the muscle into the knee and about 
0.2 cm deep. A suture is placed over the injection site for future localization, and the skin 
is closed with stainless steel clips. 

After an appropriate incubation time (e.g., 7 days) muscle extracts are prepared by 
15 excising the entire quadriceps. Every fifth 15 um cross-section of the individual 

quadriceps muscles is histochemically stained for protein expression. A time course for 
protein expression may be done in a similar fashion except that quadriceps from different 
mice are harvested at different times. Persistence of DNA in muscle following injection 
may be determined by Southern blot analysis after preparing total cellular DNA and HIRT 
20 supematants from injected and control mice. 

The results of the above experimentation in mice can be use to extrapolate proper 
dosages and other treatment parameters in humans and other animals using naked DNA. 

Example 13: Transgenic Animals 

The polypeptides of the invention can also be expressed in transgenic animals. 

25 Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, guinea 
pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e. g., baboons, 
monkeys, and chimpanzees may be used to generate transgenic animals. In a specific 
embodiment, techniques described herein or otherwise known in the art, are used to 
express polypeptides of the invention in humans, as part of a gene therapy protocol. 

30 Any technique known in the art may be used to introduce the transgene (1. e., 

polynucleotides of the invention) into animals to produce the founder lines of transgenic 
animals. Such techniques include, but are not limited to, pronuclear microinjection 
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(Paterson et al., Appi Microbiol BiotechwL 40: 691-698 (1994); Carver et al, 
Biotechiology 11: 1263-1270 (1993); Wright et al., Biotechnology 9: 830-834 (1991); and 
U. S. Pat. No. 4,873,191, the contents of which is hereby incorporated by reference herein 
in its entirety); retrovirus mediated gene transfer into germ lines (Van der Putten et al., 
5 Proc. Natl Acad, ScL USA 82: 6148-6152 (1985)), blastocysts or embryos; gene targeting 
in embryonic stem cells (Thompson et al.. Cell 56: 313-321 (1989)); electroporation of 
cells or embryos (Lo, \9S3,MolCell Biol 3: 1803-1814 (1983)); introduction of the 
polynucleotides of the invention using a gene gun (see, e, g., Ulmer et al., Science 259: 
1745 (1993); introducing nucleic acid constructs into embryonic pleuripotent stem cells 

10 and transferring the stem cells back into the blastocyst; and sperm mediated gene transfer 
(Lavitrano et al.. Cell 57: 717-723 (1989). For a review of such techniques, see Gordon, 
"Transgenic Animals," //;//. Rev. Cytol 115: 171-229 (1989). 

Any technique known in the art may be used to produce transgenic clones 
containing polynucleotides of the invention, for example, nuclear transfer into enucleated 

15 oocjies of nuclei from cultured embryonic, fetal, or aduU cells induced to quiescence 
(Campell et al. Nature 380: 64-66 (1996); Wilmut et al. Nature 385: 810813 (1997)). 

The present invention provides for transgenic animals that carry the transgene in 
all their cells, as well as animals which carry the transgene in some, but not all their cells, 
I. e., mosaic animals or chimeric. The transgene may be integrated as a single transgene 

20 or as multiple copies such as in concatamers, e.g., head-to-head tandems or head-to-tail 
tandems. The transgene may also be selectively introduced into and activated in a 
particular cell type by following, for example, the leaching of Lasko et al. (Lasko et al.. 
Proa, Natl Acad. ScL USA 89: 6232-6236 (1992)), The regulatory sequences required for 
such a cell-type specific activation will depend upon the particular cell type of interest, 

25 and will be apparent to those of skill in the art. When it is desired that the polynucleotide 
transgene be integrated into the chromosomal site of the endogenous gene, gene targeting 
is preferred. Briefly, when such a technique is to be utilized, vectors containing some 
nucleotide sequences homologous to the endogenous gene are designed for the purpose of 
integrating, via homologous recombination with chromosomal sequences, into and 

30 disrupting the function of the nucleotide sequence of the endogenous gene. The transgene 
may also be selectively introduced into a particular cell type, thus inactivating the 
endogenous gene in only that cell type, by following, for example, the teaching of Gu et 
al. (Gu et al.. Science 265: 103-106 (1994)). The regulatory sequences required for such a 
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cell-type specific inactivation will depend upon the particular cell type of interest, and will 
be apparent to those of skill in the art. 

Once transgenic animals have been generated, the expression of the recombinant 
gene may be assayed utilizing standard techniques. Initial screening may be accomplished 
5 by Southern blot analysis or PCR techniques to analyze animal tissues to verify that 
integration of the transgene has taken place. The level of mRNA expression of the 
transgene in the tissues of the transgenic animals may also be assessed using techniques 
which include, but are not limited to, Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and reverse transcriptase-PCR (rt-PCR). 
1 0 Samples of transgenic gene-expressing tissue may also be evaluated 

immunocytochemically or immunohistochemically using antibodies specific for the 
transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
crossbred to produce colonies of the particular animal Examples of such breeding 

1 S strategies include, but are not limited to: outbreeding of founder animals with more than 
one integration site in order to establish separate lines; inbreeding of separate lines in 
order to produce compound transgenics that express the transgene at higher levels because 
of the effects of additive expression of each transgene; crossing of heterozygous 
transgenic animals to produce animals homozygous for a given integration site in order to 

20 both augment expression and eliminate the need for screening of animals by DNA 

analysis; crossing of separate homozygous lines to produce compound heterozygous or 
homozygous lines; and breeding to place the transgene on a distinct background that is 
appropriate for an experimental model of interest. 

Transgenic animals of the invention have uses which include, but are not limited 

25 to, animal model systems useful in elaborating the biological function of polypeptides of 
the present invention, studying conditions and/or disorders associated with aberrant 
expression, and in screening for compounds effective in ameliorating such conditions 
and/or disorders. 

Example 14: Knock-Out Animals 
30 Endogenous gene expression can also be reduced by inactivating or "knocking out" 

the gene and/or its promoter using targeted homologous recombination. (E. g., see 
Smithies et aL, Nature 317: 230-234 (1985); Thomas & Capecchi, Cell 51 : 503512 (1987); 



wo 2004/050860 



PCTAJS2003/040063 



235 

Thompson et al., Cell 5: 313-321 (1989)) Altematively, RNAi technology may be used. 
For example, a mutant, non-functional polynucleotide of the invention (or a completely 
unrelated DNA sequence) flanked by DNA homologous to the endogenous polynucleotide 
sequence (either the coding regions or regulatory regions of the gene) can be used, with or 
5 without a selectable marker and/or a negative selectable marker, to transfect cells that 
express polypeptides of the invention in vivo. In another embodiment, techniques known 
in the art are used to generate knockouts in cells that contain, but do not express the gene 
of interest. Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the targeted gene. Such approaches are particularly suited in 

10 research and agricultural fields where modifications to embryonic stem cells can be used 
to generate animal offspring with an inactive targeted gene (e. g., see Thomas & Capecchi 
1987 and Thompson 1989, supra). However, this approach can be routinely adapted for 
use in humans provided the recombinant DNA constructs are directly administered or 
targeted to the required site in vivo using appropriate viral vectors that will be apparent to 

1 5 those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered to 
express the polypeptides of the invention, or alternatively, that are genetically engineered 
not to express the polypeptides of the invention (e. g., knockouts) are administered to a 
patient in vivo. Such cells may be obtained from the patient (i.e., animal, including 

20 human) or an MHC compatible donor and can include, but are not limited to fibroblasts, 
bone marrow cells, blood cells (e. g., lymphocytes), adipocytes, muscle cells, endothelial 
cells etc. The cells are genetically engineered in vitro using recombinant DNA techniques 
to introduce the coding sequence of polypeptides of the invention into the cells, or 
altematively, to disrupt the coding sequence and/or endogenous regulatory sequence 

25 associated with the polypeptides of the invention, e.g., by transduction (using viral 
vectors, and preferably vectors that integrate the transgene into the cell genome) or 
transfection procedures, including, but not limited to, the use of plasmids, cosmids, YACs, 
naked DNA, electroporation, liposomes, etc. 

The coding sequence of the polypeptides of the invention can be placed under the 

30 . control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 

expression, and preferably secretion, of the polypeptides of the invention. The engineered 
cells which express and preferably secrete the polypeptides of the invention can be 
introduced into the patient systemically, e. g., in the circulation, or intraperitoneally. 
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Alternatively, the cells can be incorporated into a matrix and implanted in the 
body, e. g., genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a Ijonphatic or 
vascular graft. (See, for example, Anderson et al. U. S. Patent No. 5,399,349; and 
5 Mulligan & Wilson, U. S. Patent No. 5,460,959, the contents of which are hereby 
incorporated by reference herein in their entirety). 

When the cells to be administered are non-autologous or non-MHC compatible 
cells, they can be administered using well known techniques which prevent the 
development of a host immune response against the introduced cells. For example, the 
10 cells may be introduced in an encapsulated form which, while allowing for an exchange of 
components with the immediate extracellular environment, does not allow the introduced 
cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, but 
are not limited to, animal model systems useful in elaborating the biological ftinction of 
1 5 polypeptides of the present invention, studying conditions and/or disorders associated with 
aberrant expression, and in screening for compounds effective in ameliorating such 
conditions and/or disorders. 

While preferred illustrative embodiments of the present invention are described, 
one skilled in the art will appreciate that the present invention can be practiced by other 
20 than the described embodiments, which are presented for purposes of illustration only and 
not by way of limitation. The present invention is limited only by the claims that follow. 
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We claim: 

1. An isolated nucleic acid molecule comprising: 

(a) a nucleic acid molecule comprising a nucleic acid sequence that encodes an 
amino acid sequence of SEQ ID NO: 96-237; 
5 (b) a nucleic acid molecule comprising a nucleic acid sequence of SEQ ID NO: 

1-95; 

(c) a nucleic acid molecule that selectively hybridizes to the nucleic acid 
molecule of (a) or (b); or 

(d) a nucleic acid molecule having at least 95% sequence identity to the nucleic 
10 acid molecule of (a) or (b). 

2. The nucleic acid molecule according to claim 1» wherein the nucleic acid molecule 
is a cDNA. 

15 3. The nucleic acid molecule according to claim 1 , wherein the nucleic acid molecule 
is genomic DNA. 

4. The nucleic acid molecule according to claim 1 , wherein the nucleic acid molecule 
is an RNA. 

20 

5. The nucleic acid molecule according to claim 1 , wherein the nucleic acid molecule 
is a mammalian nucleic acid molecule. 

6. The nucleic acid molecule according to claim 5, wherein the nucleic acid molecule 
25 is a human nucleic acid molecule. 

7. A method for determining the presence of a colon specific nucleic acid (CSNA) in 
a sample, comprising the steps of: 

(a) contacting the sample with the nucleic acid molecule of SEQ ID NO: 1-95 
30 under conditions in which the nucleic acid molecule will selectively hybridize to a 

colon specific nucleic acid; and 
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(b) detecting hybridization of the nucleic acid molecule to a CSNA in the 
sample, wherein the detection of the hybridization indicates the presence of a 
CSNA in the sample. 



5 8. A vector comprising the nucleic acid molecule of claim 1 . 



9. A host cell comprising the vector according to claim 8. 



10. A method for producing a polypeptide encoded by the nucleic acid molecule 
10 according to claim 1, comprising the steps of: 

(a) providing a host cell comprising the nucleic acid molecule operably linked 
to one or more expression control sequences, and 

(b) incubating the host cell under conditions in which the polypeptide is 
produced. 

15 

11. A polypeptide encoded by the nucleic acid molecule according to claim 1 . 



12. An isolated pol>T3eptide selected from the group consisting of: 

(a) a polypeptide comprising an amino acid sequence with at least 95% 
20 sequence identity to of SEQ ID NO: 96-237 ; or 

(b) a polypeptide comprising an amino acid sequence encoded by a nucleic 
acid molecule having at least 95% sequence identity to a nucleic acid molecule 
comprising a nucleic acid sequence of SEQ ID NO: 1-95. 



25 13. An antibody or fragment thereof that specifically binds to: 

(a) a polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to of SEQ ID NO: 96-237 ; or 

(b) a pol>'peptide comprising an amino acid sequence encoded by a nucleic 
acid molecule having at least 95% sequence identity to a nucleic acid molecule 

30 comprising a nucleic acid sequence of SEQ ID NO: 1-95. 

14. A method for determining the presence of a colon specific protein in a sample, 
comprising the steps of: 
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(a) contacting the sample with a suitable reagent under conditions in which the 
reagent will selectively interact with the colon specific protein comprising an 
amino acid sequence with at least 95% sequence identity to of SEQ ID NO: 96- 
237; and 

5 (b) detecting the interaction of the reagent with a colon specific protein in the 

sample, wherein the detection of binding indicates the presence of a colon specific 
protein in the sample. 



15. A method for diagnosing or monitoring the presence and metastases of colon 
1 0 cancer in a patient, comprising the steps of: 

(a) determining an amount of: 

(i) a nucleic acid molecule comprising a nucleic acid sequence that 
encodes an amino acid sequence of SEQ ID NO: 96-237; 

(ii) a nucleic acid molecule comprising a nucleic acid sequence of SEQ 
15 ID NO: 1-95; 

(iii) a nucleic acid molecule that selectively hybridizes to the nucleic 
acid molecule of (i) or (ii); 

(iv) a nucleic acid molecule having at least 95% sequence identity to the 
nucleic acid molecule of (i) or (ii); 

20 (v) a polypeptide comprising an amino acid sequence with at least 95% 

sequence identity to of SEQ ID NO: 96-237 ; or 
(vi) a polypeptide comprising an amino acid sequence encoded by a 
nucleic acid molecule having at least 95% sequence identity to a nucleic 
acid molecule comprising a nucleic acid sequence of SEQ ID NO: 1-95 

25 and; 

(b) comparing the amount of the determined nucleic acid molecule or the 
polypeptide in the sample of the patient to the amount of the colon specific marker 
in a normal control; wherein a difference in the amount of the nucleic acid 
molecule or the polypeptide in the sample compared to the amount of the nucleic 

30 acid molecule or the polypeptide in the normal control is associated with the 

presence of colon cancer. 
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1 6. A kit for detecting a risk of cancer or presence of cancer in a patient, said kit 
comprising a means for determining the presence of: 

(a) a nucleic acid molecule comprising a nucleic acid sequence that encodes an 
amino acid sequence of SEQ ID NO: 96-237; 
5 (b) a nucleic acid molecule comprising a nucleic acid sequence of SEQ ID NO: 

1-95; 

(c) a nucleic acid molecule that selectively hybridizes to the nucleic acid 
molecule of (a) or (b); or 

(d) a nucleic acid molecule having at least 95% sequence identity to the nucleic 
10 acid molecule of (a) or (b); or 

(e) a polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to of SEQ ID NO: 96-237 ; or 

(f) a polypeptide comprising an amino acid sequence encoded by a nucleic 
acid molecule having at least 95% sequence identity to a nucleic acid molecule 

1 5 comprising a nucleic acid sequence of SEQ ID NO: 1 -95. 

17. A method of treating a patient with colon cancer, comprising the step of 
administering a composition consisting of 

(a) a nucleic acid molecule comprising a nucleic acid sequence that encodes an 
20 amino acid sequence of SEQ ID NO: 96-237; 

(b) a nucleic acid molecule comprising a nucleic acid sequence of SEQ ID NO: 
1-95; 

(c) a nucleic acid molecule that selectively hybridizes to the nucleic acid 
molecule of (a) or (b); 

25 (d) a nucleic acid molecule having at least 95% sequence identity to the nucleic 

acid molecule of (a) or (b); 

(e) a polypeptide comprising an amino acid sequence with at least 95% 
sequence identity to of SEQ ID NO: 96-237 ; or 

(f) a polypeptide comprising an amino acid sequence encoded by a nucleic 
30 acid molecule having at least 95% sequence identity to a nucleic acid molecule 

comprising a nucleic acid sequence of SEQ ID NO: 1-95; 
to a patient in need thereof, wherein said administration induces an immune response 
against the colon cancer cell expressing the nucleic acid molecule or polypeptide. 
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1 8. A vaccine comprising the polypeptide or the nucleic acid encoding the polypeptide 
of claim 12. 
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1 
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<110> diaDexus, Inc. 

Macina, Roberto 
Turner, Leah 
Sun, Yongming 

<120> Compositions, Splice Variants and Methods Relating to Colon 
Specific Genes and Proteins 

<130> DEX-0448 

<150> US 60/431,132 
<151> 2002-12-04 

<150> US 60/431, 144 
<151> 2002-12-04 

<160> 240 

<170> Patentin version 3.1 

<210> 1 

<211> 547 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (103) . . (103) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (112) . . (112) 

<223> n=a, c, g or t 



<220> 

<221> mi6C_feature 

<222> (208) . . (208) 

<223> nssa, c, g or t 



<220> 

<221> misc_feature 

<222> (243) . . (243) 

<223> n=sa, c, g or t 



<220> 

<221> misc_feature 

<222> (277) . . (277) 

<223> nsa, c, g or t 



<400> 1 

cagaagagta ttcacgcctg caatgtaggt ggccgactcc tgtgtcagga caggccacct 



60 



acattgcaga agagtattca cgcctgcgca gccaggatag cantgagctc angacaccgg 120 
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ccagggacat tctcgcgagt tacagccctg aacgacgtgg aaactcgtga ttcgacgtgg 180 

ccacatgccc gatgcgaagg ccctgcantg agcagagacg tgtggacccc tgcaggatgc 240 

agncaggaag ctgtggagtt ggtccagtat gcctacnttt agagaaggtc cgtggagaac 300 

ggagaagaaa cgtaagaagc gaagtgamgg atkwastcar rgacagaaga tgaagaggag 360 

aaaagccaag aggaycmkgr agcgaagagg aagagaagga agactcgcca gccagatgcc 4 20 

aaagatgggg attcatacga mccctatgac ttcagtgaca cagaggagga aatgcctcaa 480 

gtacacactc caaagacggc agactcacag gagaccaagg aatcccagaa agtggagttg 540 

agcgaat 547 

<210> 2 

<211> 325 

<212> DNA 

<213> Homo sapien 

<400> 2 

ggcctggatt aatacccctc gaagacaagg aggactatgg gccgaataag gagtgtccac 60 

tgtgcctttg tccaagattg tttgaatccc taagccgaga tctcaaaagg actatggtgt 120 

atacctagag gactcaggcc acactcttag aggtctcttc attattgatg acaaaggaat 180 

cctaagacaa attactctga acgatcttcc tgtgggtaga tcagtggatg agacactacg 24 0 

tttggttcaa gcattccagt acactgacaa acacggagaa gtctgccctg ccggctggaa 300 

acctggtaag tgaaacaata atccc 325 



<210> 3 

<211> 1165 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_f eature 

<222> (44) . . (44) 

<223> n-a, c, g or t 



<220> 

<221> misc_feature 

<222> (50) , . (50) 

<223> n=a, c, g or t 



<220> 

<221> inisc_f eature 

<222> (57).. (58) 

<223> n=a, c, g or t 



<220> 
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<221> misc_f eature 
<222> (60).. (61) 
<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (66).. (66) 

<223> n«a, c, g or t 



<220> 

<221> misc_feature 

<222> (68).. (68) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (70).. (74) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (76).. (76) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (78).. (78) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (80) . , (81) 

<223> n=a, c, g or t 



<220> 

<221> mi sc_f eature 

<222> (83).. (84) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (87).. (87) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (95) . . (95) 

<223> n=a, c, g or t 



<220> 

<221> misc feature 
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<222> (98) . . (98) 

<223> n=a, c, g or t 

<220> 

<221> misc_feature 

<222> (lOlT. . (101) 

<223> nsa, c, g or t 



<220> 

<221> misc_feature 

<222> (104) , . (104) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (110) , , (111) 

<223> n=a, c, g or t 



<4C0> 3 
aaccgctccg 


gcaacgccaa 


ccgctccgct 


Qcgcgcaqqc 


tggnctgcan 


gctctcnncn 


60 


ncagcncncn 


nnnnancnan 


nannccnctt 


ccaanatntc 


ncanccctcn 


ncctccctct 


120 


gctgcctgct 


ggtgttggcc 


aatgcccgga 


gcaggccctc 


tttccatccc 


ctgtcggatg 


180 


agctggtcaa 


ctatgtsaac 


aaacggaata 


ccacgtggca 


ggccgggcac 


aacttctaca 


24 0 


acgtggacat 


gagctacttg 


aagaggctat 


gtggtacctt 


cctgggtggg 


cccaagccac 


300 


cccagagagt 


tatgtttacc 


gaggacctga 


agctgcctgc 


aagcttcgat 


gcacgggaac 


360 


aatggccaca 


gtgtcccacc 


atcaaagaga 


tcagagacca 


gggctcctgt 


ggctcctgct 


420 


gggccttcgg 


ggctgtggaa 


gccatctctg 


accggatctg 


satycacacc 


aatgcgcacg 


480 


tcgagcgtgg 


aggtgtcggc 


ggaggacctg 


ctcacrcgct 


gtggcagcat 


gtgtggggac 


54 0 


ggctgtaatg 


gtggctatcc 


tgctgaagct 


tggaacttct 


ggacaaga.aa 


aggcctggtt 


600 


tctggtggcc 


atctatgaat 


cccatgtagg 


gtgcagaccg 


taytccatac 


cctccctgtg 


660 


agcaccacgt 


cgaaggggtc 


ccggbcccca 


tgcacggggg 


agggagatac 


ccscargtgt 


720 


agcaagalcct 


gtgagcctgg 


gtacagaccg 


actayaaacw 


ggacamgcgy 


tayggataca 


780 


attctacagg 


tstccaatag 


cgaggacrtc 


atggcygaga 


cctmaaaaaa 


cggcccgtgg 


840 


agggagcttt 


ctctgtgtat 


cggacttcct 


ggtctagaag 


tcagggagtg 


taccaacacg 


900 


tcaccggaga 


gatgattggg 


tgggcatgcc 


atccggatcc 


tgggctgggg 


agtggcgaag 


960 


gacaccctac 


tggtggttgc 


atcggaaaca 


ctgactgggg 


tgacatgggt 


cttaaatacc 


1020 


cgaggcagga 


cacgtgggac 


cgcacacatg 


gggggggaac 


cggccgagca 


gcggaaaaat 


1080 


aatgcgggcg 


tggcactgcg 


gccacaggaa 


aacgtaatca 


gcacacaaaa 


agaggaggca 


1140 
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ggaacaaccc ggaaactgac gtgga 



<210> 4 

<211> 754 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_f eature 

<222> (29)7.(29) 

<223> n=a, c, g or t 



1165 



<220> 

<221> misc_feature 

<222> (135) . . (135) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (176) . . (176) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (235T. . (235) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (256) . . (256) 

<223> n=a, c, g or t 



<400> 4 

tcacctccct ccacgaggac gattggcgnt ctcggccttc ccggggccct gccctcaccc 60 

cgatccggga tgaggagtgg ggtggccact ccccccggag tcccagggga tgggaccagg 120 

agcccgccag ggagnaggca ggcgggggct ggcgggccag gcggccccgg gcccgntccg 180 

atagacgcca ctggacgacc tcaccccgcc gagcaccgca cgagtcaggg agcanctctc 240 

ccacgaataa tggtgngaga agccgggcac tacatgcccc cgcagagccc cagccgggac 300 

gacctctatg accaagacaa ctctagggac attcccacgc taccacaggc gacccccatc 360 

tacgacaaca ttcaggctcc tcgggagcgc cctcctgcct accccaggtc ccaccaccac 420 

cgtacccggg accctcggga caacggctcc aggtccgggg acctccccta tgatgggcgg 480 

ctactggagg aggctgtgag gaagaaaggg gtcggaggag aggaggatac cccacaagga 540 

ggaggaggaa gaggcctact acccgcccgc gccgcccccg tactcggaga ccgactcgca 600 

ggcgtcccga gagcgcaggc tcaagaagaa cttggccctg agtcgggaaa gtttagtcgt 660 
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ctgatctgac gttttctacg tagctttygt attttttwty ytaatttgaa ggcacactga 720 

tgaagcatac cgtacccctc cgagttataa cgtc 754 

<210> 5 

<211> 1269 

<212> DNA 

<213> Homo sapien 

<400> 5 

cagcccgcgc gggtactgtg tgagtctgcg tctggtggga cggggacggg cgcggtggaa 60 

ggttgatagg cgctgtgggt ttgagcttgg tgcgttgtgc gcgttgaggg ggttatgttg 120 

Ctcggtgtcc tgtgtgtttg tatatgggat agtgcgtgct tggctgtgct gaggtgtgtg 180 

ctcgggtgtg ttattggcgt ggtcagatgc gtcttggagt tttcgtgagg cgcctttgtg 24 0 

tgtccccggg cgcggggatg acgttgtgtt ggtcgtggtg tgcttgtggg agccacacgt 3 00 

gggaactgcg gttggaaaat attatagaag ggctaaatgt gggggcccat cctcccttga 360 

cgggatttgt atgatgtcta gtgagggacg cgacgtatgt gggggcttgc gtttcctgtc 420 

gtgcatctga gagtgtacgg tgttatggcg tgcactatgc cgtttctcaa aaaagtttcc 480 

ttgctgaggc atagcatcag gagtctgagg tcgctctttg gtcattaccc ttgggaatat 540 

taagtgtgtt catcaggctg gtgttgttag tcagcaaaca gctgtgttaa taattagagt 600 

actggttctt aagtgatatc agttattgtt agggtatgat gttaagacta ttgatttcac 660 

ctatacagtg aggatatagg tttataagag atacttcatc cagcacaccc ttactctgcc 720 

agagtagtga agctaattaa acacgttcgg tttctgaata aattgaacta aatccaaact 780 

atttcctaaa atcacaggac attaaggacc aatagcatct gtgccagaga tgtactgtta 84 0 

ttagctggga agaccaattc taacagcaaa taacagtctg agactcctca tacctcagtg 900 

gttagaagca tgtctctctt gagctacagt agaggggaag ggattgttgt gtagtcaagt 960 

caccatgctg aatgtacact gattccttta tgatgactgc ttaactcccc actgcctgtc 1020 

ccagagaggc tttccaatgt agctcagtaa ttcctgttac tttacagaca ggaaagttcc 1080 

agaaacttta agaacaaact ccgaaaacct atgagcaaat ggtgctgaat acwttttttt 1140 

taaagccaca tttcattgtc ttagtcaaag caggmttatt aastgattat ttaaaattcr 1200 

wywtkwtawa ttagcaactt caagtataac awcttwkaaa ctggaataag tgtttatttt 1260 

ctattaata 1269 



<210> 6 

<211> 1403 

<212> DMA 

<213> Homo sapien 
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<400> 6 
cagcccgcgc 


gggtactgtg 


tgagt ctgcg 


ggtcgatagg 


cgctgtgggt 


ctgagcctgg 


ttcggtgtcc 


tgtgtgtttg 


tatatgggat 


ctcgggtgtg 


ttattggcgt 


ggtcagatgc 


tgtccccggg 


cgcggggatg 


acgttgtgtt 


gggaactgcg 


gttggaaaat 


attatagaag 


cgggatttgt 


atgatgtcta 


gtgagggacg 


gtgcatctga 


gagtgtacgg 


tgttatggcg 


ttgttgaggc 


atagcatcag 


gagcctgagg 


taagtgtgtt 


catcaggctg 


gtgttgttag 


actggttctt 


aagtgatatc 


agttattgtt 


ctatacagtg 


aggatatagg 


tttataagag 


agagtagtga 


agctaattaa 


acacgtttgg 


atttcctaaa 


atcacaggac 


attaaggacc 


ttagctggga 


agaccaattc 


taacagcaaa 


gttagaagca 


tgtctctctt 


gagctacagt 


gagacatgct 


tctaaccact 


gaggtatgag 


aagggattgt 


tgtgtagtca 


agtcaccatg 


tgcttaactc 


cccactgcct 


gtcccagaga 


Cactttacag 


acaggaaagt 


tccagaaact 


caccatttgc 


tcataggtct 


ttcagagttt 


gaatacwttt 


tttttaaagc 


cacatttcat 


ttatttaaaa 


ttcrwywtkw 


tawattagca 


taagtgttta 


ttttctatta 


ata 


<210> 7 

<211> 200 

<212> DNA 

<213> Homo sapien 




<400> 7 
ccaccggcag 


gttgtactca 


ccacctgaat 


ccaccaagca 


gttcagaaac 


ctaccaccag 


cttctcagaa 


aaccaccaca 


aaaaccacca 
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tctggtggga 


cggggacggg 


cgcggtggaa 


60 


tgcgttgtgc 


gcgttgaggg 


ggttatgttg 


120 


agtgtgtgct 


tggctgtgct 


gaggtgtgtg 


180 


gtcccggagt 


tttcgtgagg 


cgcctttgtg 


240 


ggtcgtggtg 


tgcttgtggg 


agccacacgt 


300 


ggctaaatgt 


99g99cccat 


cctcccttga 


360 


cgacgtatgt 


gggggcttgc 


gtttcctgtc 


420 


tgcactatgc 


cgtttctcaa 


aaaagtttcc 


480 


tcgctctttg 


gtcattaccc 


ttgggaatat 


540 


ttagtaaaca 


gctgtgttaa 


taattagagt 


600 


agggtatgat 


gctaagacta 


ttgatttcac 


660 


atacttcacc 


cagcataccc 


ttactctgcc 


720 


tttctgaata 


aattgaacta 


aatccaaact 


780 


aatagcatct 


gtgccagaga 


tgtactgtta 


840 


taacagtctg 


agactcctca 


tacctcagtg 


900 


agaggggaag 


99attgttgt 


gtagtcaaga 


960 


gcatgtctct 


cttgagctac 


agtagagggg 


1020 


ctgaatgtac 


actgattcct 


tcacgatgac 


1080 


ggctttccaa 


tgtagctcag 


taattcctgt 


1140 


ttaagaacaa 


actctgaaag 


acctattcag 


1200 


gttcttaaag 


acctatgagc 


aaatggtgct 


1260 


tgtcttagtc 


aaagcaggmt 


tattaastga 


1320 


acttcaagta 


taacawcttw 


kaaactggaa 


1380 








1403 


gcagaggaaa 


atctctaact 


tccaaaggtc 


60 


taaatgttcc 


aactacagaa 


gtctcaccaa 


120 


caccaaatgc 


tcawgcwaca 


cggagtacac 


180 
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ccgccaggga 


tcccctagaa 












<210> 8 

<211> 983 

<212> DNA 

<213> Homo sapien 












<400> 8 
ctctgggccc 


tgtgttttcc 


cgagcacctt 


ttctgactct 


ggtctggatt 


acatgtgtcg 


60 


gtatgtggca 


cttgtcccct 


tttgcccttg 


gtatatgcga 


cccttctatt 


gtattgcgcc 


120 


ctttgtgccc 


gcactttccg 


gtccatgtcg 


gggatgatgg 


atcccccttc 


ccctttgctc 


180 


agttgcctcc 


tggggccagg 


ggcccctcac 


cacaaggggt 


ctggatatac 


agtttcataa 


240 


ggcctggccc 


tcccatgttt 


gcatgcctat 


gtacttctac 


gccaaacgtg 


tcagcccttc 


300 


ctcctgaagc 


tctctgccgt 


gcctccctgt 


tctgggagga cggtggtgcg gtgtgactga 


360 


atttgggcct 


cttgtacagt 


taactctccc 


aggtggattt 


tgtggaggtg 


agtaaaaggg 


420 


gcattgagac 


tataaagcag 


tagacaatcc 


ccacatacca 


tctgtagagt 


tggaactgca 


480 


ttctttcaaa 


gtwtkrrtaw 


gcatataytt 


gtagggctgt 


agacttacgg 


tcctawtakt 


540 


ctkwttrcca 


tygcttattc 


twgagcacwa 


aagtgataat 


caattakyac asttsawaca 


600 


tcacctttwt 


tgacttgttc 


caagcccttt 


tacagcccww 


tggcattttc 


ctcgcymagg 


660 


cctgtgaggt 


aactgggatc 


gcacctttta 


taccagagac 


ctgaggcaga 


tgaaatwtat 


720 


ttccatctag 


gactagaaaa 


acttgggtct 


cttaccgcga 


gactgagagg 


cagaagtcag 


780 


cccgaatgcc 


tgtcagtttc 


atggagggga 


aacgcaaaac 


ctgcagttcc 


tgagtacctt 


840 


mtacaggcmc 


ggcccagcyt 


aggcccgggg 


tggcaacacc 


acagcaagcc 


ggccccccct 


900 


cttttggcct 


tgtggataag 


ggagagttga 


ccgttttcat 


cctgcctccg 


ttattgtaac 


960 


ttggaggttc 


acacgggtgc 


gtt 








983 


<210> 9 

<211> 676 

<212> DNA 

<213> Homo sapien 












<220> 

<221> misc_£eature 
<222> (6767- . (676) 
<223> n=a, c, g or t 












<400> 9 

caaagacttc gactcccctg 


agaacggtgc ggacagcttc 


cagagctcag 


actccctcct 


60 


tcagtcctgg 


aacagccagt 


cgtccttgct 


ggatgtgcaa 


cgggttcctt 


ccttcgagag 


120 
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cttsgragrt 


gactgcakcc 


wkastccmtk 


9 

cctcaataag 


ccaacatgtm 


tttcaaggat 


180 


tacatccaag 


agaggagtga 


cccrktggag 


caaggcaaac 


cagttatacc 


tgcagctgtg 


240 


ctggccggct 


tcacaggaag 


tggacctatt 


cagctgtggc 


agtwtctcct 


ggagctgyta 


300 


tcasacaaac 


cctgccagtc 


attcatcasc 


tggactgkag 


ackgatggsa 


gtttaagctc 


360 


gccgaccccg 


atgaggtggc 


ccgccagtgg 


ggaaagaggc 


ttagcccacg 


atgaactacg 


420 


agaagctgaa 


ccggggctta 


cgctactatt 


acgactagaa 


catcatgcac 


aagacgtcgg 


480 


agaaagcgcc 


acgtgttctc 


gattaaatgt 


gcggacctca 


gaaacttgct 


tggggtttca 


540 


acgccccgag 


ggaactgtaa 


cgcgcataac 


ctgggcagta 


acaaccccgt 


acaccggaag 


600 


gtacctgacg 


ttccgcccgg 


ggacccaccc 


cttgaacccg 


gctccccaag 


gcttcgcggt 


660 


acctgttggg 


tgcccn 










676 


<210> 10 
<211> 2220 
<212> DNA 
<213> Homo sapien 












<400> 10 

gggggggaga 


agaggggagg 


gcatcatggg 


gccagtgccg 


tctgttcggc 


cctgggaagc 


60 


tgcgctgggc 


tggcttgcct 


cctgtgtggc 


tgtgccaggg 


ccatccagga 


gtgttgcact 


120 


tgggacctgg 


aggatgggaa 


ggccgagagg 


cctttggact 


gccaaaccat 


ctggaagtca 


180 


gccttttaca 


gacttcggca 


ggttccggct 


ctccaggtgt 


gatgggttca 


ggatggctga 


240 


acctggagat 


agtctggagt 


ctctttgaag 


gacctgcttg 


gctcctgctg 


caaaggaact 


300 


gcaggcattt 


gtccttccct 


tccctccccc 


accccactgc 


tgaaaaaggt 


tggcgggggg 


360 


aaagcagctc 


tgcttttcac 


agtgtatacg 


tgtcaggaga 


cagcagaggt 


gctgggttga 


420 


agattgcagg 


tggcaggccc 


agcccaggct 


gctgctcagt 


gggtgcctgg 


ccttcatcat 


480 


ctcggcccac 


ctgcttcttg 


tggtgtggcc 


agtcccagct 


gccgccttaa 


agcagatgtg 


540 


cgcaacagtg 


tacaggcaca 


cctgccacag 


caaattccta 


ggatacctca 


gtcctccatg 


600 


ttggtatgct 


tttcttttca 


acaatgtgga 


aaacgcagtt 


taccaagaag 


cttttttggt 


660 


tttcctttta 


aggtattact 


tcaaaaaagc 


aagcgatgag 


tttgcctgtg 


gagcggtgtt 


720 


tgaggagatc 


tgggaggatg 


agacggtgct 


cccgatgtat 


gaaggccgga 


. ttctgggcaa 


780 


agtggagcgg 


atcgattgag 


ccctggggtc 


tggctttggt 


gaactgttgg 


agcccgaagc 


840 


tcttgtgaac 


tgtcttggct 


gtgagcaact 


gcgacaaaac 


attttgaagg 


aaaattaaac 


900 


caatgaagaa 


gacaaagtct 


aaggaagaat 


cggccagtgg 


gccttcggga 


gggcgggggg 


960 


aggttgattt 


tcatgattca 


tgagctgggt 


actgactgag 


ataagaaaag 


cctgaactat 


1020 



wo 2004/050860 



PCT/US2003/040063 









10 








ttactaaaaa 


catgaccact 


cttggctact 


gaagatgctg 


cctgtatttg 


agagactgcc 


1080 


atacataata 


tatgacttcc 


tagggatctg 


aaatccataa 


accaagagaa 


actgtgtata 


1140 


gcttacctga 


acaggaatcc 


ttactgatat 


ttaCagaaca 


gctgatttcc 


cccatcccca 


1200 


gtctatggat 


atgctgcttt 


aaacctggaa 


gggggagaca 


ggaagtttta 


attgttctga 


1260 


ctaaacttag 


gagttgagct 


aggagtgcgt 


tcatggtttc 


ttcactaaca 


gaggaattat 


1320 


gctttgcact 


acgtccctcc 


aagcgaagac 


agactgtttt 


agacagactt 


tttaaaacgg 


1380 


tgccctacca 


ttgacacatg 


cagaaatcgg 


tgcgttttgt 


tttttttttt 


tcctatgctg 


1440 


ctccgttttg 


tcttaaaggt 


cttgagggtt 


gaccatgttg 


cgtcatcatc 


aacattttgg 


1500 


gggttgtgtt 


ggatgggatg 


atctgttgca 


gagggagagg 


cagggaaccc 


tgctccttcg 


1560 


ggccccaggt 


tgatcctgtg 


actgaggctc 


cccctcacgt 


agcctcccca 


ggcccagggc 


1620 


cctgaggcct 


gctagaatca 


ctgccgctgt 


gctttcgtgg 


aaatgacagt 


tccttgtttt 


1680 


ttttgtttct 


gtttttgttt 


tacattagtc 


attggaccac 


agccattcag 


gaactacccc 


1740 


ctgccccaca 


aagaaatgaa 


cagttgtagg 


gagacccagc 


agcacctttc 


ctccacacac 


1800 


cttcattttg 


aagttcgggt 


ttttgtgtta 


agttaatctg 


tacattctgt 


ttgccattgt 


1860 


tacttgtact 


atacatctgt 


atatagtgta 


cggcaaaaga 


gtattaatcc 


actatctcta 


1920 


gtgcttgact 


ttaaatcagt 


acagtacctg 


tacctgcacg 


gtcacccgct 


ccgtgtgtcg 


1980 


ccctatattg 


agggctcaag 


ctttcccttg 


ttttttgaaa 


ggggtttatg 


tataaatata 


2040 


ttttatgcct 


ttttattaca 


agtcttgtac 


tcaatgactt 


ttgtcatgac 


attttgttct 


2100 


acttatactg 


taaattatgc 


attataaaga 


gttcatttaa 


ggaaaatcac 


ttggtacaat 


2160 


aattattgta 


attaagagat 


gtagccttta 


ttaaaatttt 


atatttttca 


aaaaaaaaaa 


2220 


<210> 11 
<211> 1407 
<212> DNA 
<213> Homo sapien 












<220> 

<221> misc feature 
<222> (1175) . . (1175) 
<223> n=a. c, g or t 












<400> 11 

tcgacccaks gcgatcwcgg 


aggagaagca 


ggagctgtcg 


ggaagatcag 


aagccagtca 


60 


tggatgacca 


gcgcgacctt 


atctccaaca 


atgagcaact 


gcccatgctg 


ggccggcgcc 


120 


ctggggcccc 


ggagagcaag 


tgcagccgcg 


gagccctgta 


cacaggcttt 


tccatcctgg 


180 


tgactctgct 


cctcgctggc 


caggccacca 


ccgcctactt 


cctgtaccag 


cagcagggcc 


240 
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accccccaga 






W4 W ^ 9^ Vi^ w» ^ Nij fc* 


h«V^ WW WOW 






uy en wciy oy y CI 


v,.v,«auyLyav.>y 


wo WW w www 




aaac t tccca 

>5 www W 


gagaacctga 


cL^«>yycici^vj ^ 


r*!" t" t*oaaaan 


Lggacgcacc 


wVrUtryyoywo 


aaaai^^f^a 


^OW^W W WWW-W 


u^eiy^VvOivvCiu 


w<w>w>uyywuyw 


wwoiwww*3y*jw 


aow c en i> w wy w 


L»wv»a^ ^^y 


^^'^^yyyy**^ 


wy y V'O^-y y wy 


W W w W WWM^X^U 


ccagaagccg 




ccatct tcta 

Vpr W ^ WVvV> VrWS'^ 


yy wwyyy w 


y uy ay ay woy 


<».oy ayy wy y i- 


ct tcaacatc 






w w w w w w w^ w w 


Lyy L>y^^ ^yy 




d W W W W W ^ ^ ^ 


aauywaywaa 


yy^'-^^y^^-y 


cccaa tctcc 

W W W 09 W W W w w 


aagtggccaa 


aagcctagac 


agataccccg 


acaaggctcc 


aaagkaccta 


gggctcatgg 


taaccctaca 


ccckagaccc 


caggctggac 


ggccttggct 


tttctagcct 


atttacmtgc 


gccatcaccc 


cccgaggaag 


agccaat 



<210> 12 

<211> 673 

<212> DNA 

<213> Homo sapien 

<400> 12 

ggtcgcactg gtgatgcggt ctgttgtccc 

ccccagcgct ggtttcgact tcagcttcct 

tggtggccgc tactaccggg ctgatgatgc 

ggacaccacc ctcaagagcc tgagccagca 

ccgcaagaac cccgcccgca cctgccgtga 

tggcgagtag tggattgatc ccaaactaag 

catgcgatca atggagactg gtgagacgct 
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acctgcagcc 


ggagaacctg 


cgcatgaagc 


300 


tgcgcatggc 


caccccgctg 


ctgatgcagg 


360 


ccatgcagaa 


tgccaccaag 


cacggcaaca 


420 


agaatgctga 


ccccctgaag 


gtgtacccgc 


4 80 


gacaccttaa 


gaacaccatg 


gagaccacag 


540 


attggctcct 


gtttgaaatg 


agcaggcact 


600 


cgaaagtact 


gaccaagtgc 


caggaagagg 


660 


tcattcaggc 


ccaagtgcga 


cgagaacggc 


720 


catcggctac 


cgctggcgtg 


tcttccccaa 


780 


cgggcaccat 


aactgcagtg 


agtcactgga 


840 


gaccaagcag 


gatctgggcc 


cagtccccat 


900 


ctgccagccc 


cacacagcta 


cagctttctt 


960 


tcccaccctg 


tacctcatcc 


catgagaccc 


1020 


aagacaaacc 


aagtcggaac 


agcagataac 


1080 


atctgtcaac 


aggggcgtga 


ggtcccaggd 


1140 


gttcnctgac 


avtacacagc 


agcctccaac 


1200 


acgagkatgg 


gaaggcacag 


ggagaaggga 


1260 


atgctgactg 


tcctctcccc 


tccagccttt 


1320 


aggctgagcc 


actcwyrttc 


cctttcccca 


1380 








1407 


ccggccccct 


ggactcccgg 


tcccskggtc 


60 


gccccagcca 


cctcaagaga 


aggctcacga 


120 


caatgtggtt 


cgtgaccgtg 


acctcgaggt 


180 


gatcgagaac 


atccggagcc 


cagagggcag 


240 


cctcaagatg 


tgccactctg 


actggatgag 


300 


gctgcaggct 


ggatgccatc 


ctattcgttg 


360 


gcgtgtaccc 


cactcagcgc 


cagtgtggcg 


420 
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ccagaagaac 


tiggtacatca 


12 

gcaagaaccc caaggacaag aggcatgtct 




480 


gagcatgacc 


gatggattcc 


agttcgagta 


tggcggccag ggctccgacc 


ctgccgatgt 


540 


ggccatccag 


ctgaccttcc 


tgcgcctgat gtccascgag gccttccaga 


acatcaccta 


600 


ccactgtaag 


aacagtgtgg 


cctacatgga 


ccaccagact ggcaacctta 


agaaggccct 


660 


gctcctccag 


ggc 










<210> 13 

<211> 382 

<212> DNA 

<213> Homo sapien 










<400> 13 
agcagcatgg 


cacttaacag 


agagttctct 


ttcattgtga tcactaccgt 


gacac^tiac^ 


D U 


ttgtgcctat 


caggactttt 


tgcaatattg 


cgctctgtcg gctttccaat 


ct t caggga t 


1 *> n 


atcatcgagc 


tagaccaccc 


cctactatgg 


atctattttt cttcctttca 


aacacagtaa 


180 


ggaaacaatc 


tattactttt 


ttccttaaaa 


ggagaattta tagcactgta 


atacagctww 


240 


aaaatatctt 


tagaatgatg 


taaatagtta 


accttcagta gtctattaag 


gcattaatac 


300 


ttctctgaca 


tgcgcgtttg 


agggtggagg ggtcctgaag gtgcttcatc gtctgtgatt 


360 


actgcttggg 


atgtgttctt 


tg 






J O A 


<210> 14 

<211> 911 

<212> DNA 

<213> Homo sapien 










<220> 

<221> misc_f eature 
<222> (911) . . (911) 
<223> n=a, c, g or t 










<400> 14 
agc9gaacgg 


agagaacagg 


aaagcgcgag 


gagccgvcgc caccaccagc 


gcagcagtcc 


O SJ 


tggagctgtg 


aggagatccg 


ggccgtcacc 


ctgcctcccc tgcgtcccgc 


caccggccgc 


1 00 


ttctgtcctc 


ggacccattc 


caacaatctc gtaaaacatg gtggattact 


atgaagttct 




aggcgtgcag 


agacatgcgc 


tctacccgag 


ggatattaca aaaggcatat 


cgggaaactg 




gcactgaagt 


ggcatccaga 


taaaaatcct 


gagaataaag aagaagcagc 


agcagaaaat 




tcaagcaagt 


agcggaggca 


tatgaagtgc 


tgtcggatgc taagaaacgg 


gacatctatg 


360 


acaaatatgg 


caamagaagg 


attaaatggt 


ggaggacgga ggtggaagtc 


attttgacag 


420 


tccatctgaa 


tttggcttca 


cattccgtaa 


cccagatgat gccttcaggg 


aattttttag 


480 


gtggaaggga 


cccattttca 


tttgacttct 


ttgaagaccc ttttgaggac 


ttctttggga 


540 
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atcgaagggg 


tccccgagga agcagaagcc gagggacggg gtcgtcttak 


tctgcgtcca 


600 


gtggatttcc 


gtcttttgta 


agtggatggt 


cttccacgga 


tgcaggattt 


acttcattgg 


660 


ggtcactggg 


tcacggggtc 


ctcactctat 


tctcttccac 


gtcatttggt 


ggtagtggca 


720 


tgggcaacta 


taaatcgaca 


tcaacttcca 


ctaaactggt 


taatggcaga 


ccaatcacta 


780 


caaagagaat 


tgttgataac 


agtcaagaca 


gagtacaagt 


tgaagatgat 


ggccagttaa 


840 


agttcttaac 


tattggttat 


gagcagctgc 


tgtgcttgga 


taacaagtga 


ttcaacgcac 


900 


gcgcttagct 


n 










911 



<210> 15 

<211> 431 

<212> DNA 

<213> Homo sapien 

<400> 15 

ttaagakcgc kacgggcgct ttcctttcag cggagcgcgg cggcaagatg gcagtgcaaa 60 

tatccaagaa gaggaagttt gccgctgatg gcatcttcaa agctgaactg aatgagtttc 120 

ttactcggga gctggctgaa gatggctact ctggagttga ggtgcgagtt acaccaacca 180 

ggacagaaat cattatctta gccaccagaa cacagaatgt tcttggtgag aagggccggc 240 

ggattcggga actgactgct gtagtccaga agaggtttgg ctttccagag ggcagtgtag 300 

agctttatgc tgaaaaggtg gccactagag gtctgtgtgc catttsccca gcagagyyty 360 

tgcsgtacma actcyaggas ggctcgctgc gccgcgtctt tcctatcgct gtcccgcacg 420 

cccatggggc c 431 



<210> 16 

<211> 1047 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (7).. (7) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (64)7. (64) 

<223> n^a, c, g or t 



<220> 

<221> misc_f eature 

<222> (66).. (66) 

<223> n^a, c, g or t 
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<220> 

<221> naisc_f eature 

<222> (71).. (71) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (73) . . (73) 

<223> ns=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (80) . . (80) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (95).. (95) 

<223> n=a, c, g or t 



<220> 

<221> misc_f eature 

<222> (110) . . (110) 

<223> n=a, c, g or t 



<220> 

<221> misc_feature 

<222> (136) . . (136) 

<223> n=a, c, g or t 



<400> 16 

cccatgncct gcagaaacac ctggctgggc tgggcctgac tgaggccatt gacaagaaca 60 

aggntntttt ntnacgcatn tcaggcaaga aggantttta cctggcccan tttccacgcc 120 

accgcctttg agttgnacac agatggcaac ccctttgacc aggacatcta cgggcgcgag 180 

ggaggcgcag ccccaagctg ttctacgccg accacccctt catcttccta gtgcgggaca 240 

cccaaagcgg ctccctgcta ttcattgggc gcctggtccg gcctaagggt gacaagatgc 3 00 

gagacgagtt atagggcctc agggtgcaca caggatggca ggaggcatcc aaaggctcct 360 

gagacacatg ggtgctattg gggttggggg ggaggtgagg taccagcctt ggatactcca 420 

t^9999tg9gg gtggaaaaac agaccggggt tcccgtgtgc ctgagcggac cttcccagct 480 

agaattcact ccacttggac atgggcccca gataccatga tgctgagccc ggaaactcca 540 

catcctgtgg gacccgggcc atagtcattc tgcctgccct gaaagtccca gatcaagcct 600 

gcctcaatca gtattcatat ttatagccag gtaccttctc acctgtgaga ccaaattgag 660 

ctaggggggt cagccagccc tcttctgaca ctaaaacacc tcagctgcct ccccagctct 720 
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atcccaacct 


ctcccaacta 


taaaactagg 


tgctgcagcc 


cccgggacca 


ggcaccccca 


780 


gaatgacctg 


gccgcagtga 


ggcggattga 


gaaggagctc 


ccaggagggg 


cttctgggma 


840 


gactccggtc 


aagaagcatc 


gtgtctggcg 


ttgtggggat 


gaactttttg 


ttttgtttct 


900 


tcctttttta 


gttcttcaaa 


gatagggagg 


gaagggggaa 


catgagcctt 


tgttgctatc 


960 


aatccaagaa 


cttatttgta 


catttttttt 


tcaataaaac 


ttttcccart 


gsaaaraaac 


1020 


ccaaaaaaac 


cgagactagt 


tctctcc 








1047 


<210> 17 

<211> 833 

<212> DNA 

<213> Homo sapien 












<400> 17 
cccgaugccg 


oca ego. I. u t c 


ccgctcggcc 


gtgggtggtg 


aagcttgtag 


cctcgctcca 


60 


tgaggatctt 


tcatgaggta 


tcggtcaggt 


cccggcccag 


cccggtccca 


acgcccggat 


120 


ggctggggga 


gggcgtagcc 


ctcgtagatg 


ggccccctgt 


gggtgacccc 


ctctcccgag 


180 


tccctgacca 


tgccgcgttc 


gcaccaaaac 


cgtgaagaag 


gcggcccggg 


tcatcataga 


240 


aaagtactac 


acgcgcctgg 


gcaacgactt 


ccacacgaac 


aagcgcgtgt 


gcgaggagat 


300 


cgccattatc 


cccagcaaaa 


agctccgcaa 


caagatagca 


ggttacgtca 


cgcatctgat 


360 


gaagcgaatt 


cagagaggcc 


cagtaagagg 


tatctccatc 


aagctgcagg 


aggaggagag 


420 


agaaaggaga 


gacaattatg 


ttcctgaggt 


ctcagccttg 


gatcaggaga 


ttattgaagt 


480 


agatcctgac 


actaaggaaa 


tgctgaagct 


tttggacttc 


ggcagtctgt 


ccaaccttca 


540 


ggtcactcag 


cctacagttg 


ggatgaattt 


caaaacgcct 


cggggacctg 


tttgaatttt 


600 


ttctgtagtg 


ctgtattatt 


ttcaataaat 


tctgggacca 


ccagccttag 


aaacacaaga 


660 


aagagaaact 


gggaggccta 


tattgcgggg 


gcgggaaaga 


ggggttggag 


aagatgggcc 


720 


taaccggtgg 


tgtatcctgg 


ttgtcgctga 


cgcagaggtt 


tgctgtgtac 


tagatggggc 


780 


agaatctggc 


cgggtcccta 




ctgcatgaat 


tacaataaac 


gag 


833 


<210> 18 
<211> 1106 
<212> DNA 
<213> Homo sapien 












<400> 18 
cagtaagttt 


ggcatgggtg 


cagagggagg 


ggtccagtgc 


accttttgag 


tccttacctc 


60 


gtgggtagtt 


gttgccttgt 


gactgcccat 


^^gggc^att 


gaatagcaca 


ttggtggcta 


120 


tacgttgttg 


cacagtgctc 


aagtgcatag 


cgccctgccg 


gttgttcgca 


aggcaggagc 


180 



wo 2004/050860 

aacccctttt taggcaacgg gggtctctaa 
ggtgcgcatg tcagcagcac ggagcctccc 
atgtctgggc aatgccaggg tcctgggaac 
cagcgcgggc cgggggcttc tgggagccaa 
caacatgggc cgcgttcgca ccaaaaccgt 
gtactacacg cgcctgggca acgacttcca 
cattatcccc agcaaaaagc tccgcaacaa 
gcgaattcag agaggcccag taagaggtat 
aaggagagac aattatgttc ctgaggtctc 
tcctgacact aaggaaatgc tgaagctttt 
cactcagcct acagttggga tgaatttcaa 
tgtagtgctg tattattttc aataaatstg 
agcaataaat gcaactgcaa ataatgcctg 
ccggcaaaac caaaacaaat agaaacaccg 
caacccgcga tgaagacaaa acaaacgcgc 
ggatactcca caccaacccg gcaaat 

<210> 19 

<211> 744 

<212> DNA 

<213> Homo sapien 

<400> 19 

taacaggctt cgcctgtgct tcctgtttcc 
gcgttcgcac caaaaccgtg aagaaggcgg 
gcctgggcaa cgacttccac acgaacaagc 
gcaaaaagct ccgcaacaag atagcaggtt 
gaggcccagt aagaggtatc tccatcaagc 
attatgttcc tgaggtctca gccttggatc 
aggaaatgct gaagcttttg gacttcggca 
actgtaggct gagtgacctg aaggttggac 
tcaaaacgcc tcggggacct gtttgaattt 
ttctgggacc accagcctta gaaacacaag 
ggcgggaaag aggggttgga gaagatgggc 
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tgcccgagca ctgtgggctt ggtcacagga 24 0 

cgggcaggat gacttttgag ggggacacag 300 

agaggccccg agcaggacca ggagtgcggg 360 

aggcgaggct gaggttgcaa actctggggc 420 

gaagaaggcg gcccgggtca tcatagaaaa 480 

cacgaacaag cgcgtgtgcg aggagatcgc 54 0 

gatagcaggt tacgtcacgc atccgacgaa 600 

ctccatcaag ctgcaggagg aggagagaga 660 

agccttggat caggagatta ttgaagtaga 720 

ggacttcggc agtctgtcca accttcaggt 780 

aacgcctcgg ggacctgttt gaattttttc 840 

ggacmacagc ataaaatata aaagacagag- 900 

ctccaagcac gacacaaaaa acagaaccat 960 

acatacacag caaacaagga catgaaccag 1020 

gacgcaaaaa gaacaaacca taccaccgaa 1080 

1106 



tcttttacca aggacccgcc aacatgggcc 60 

cccgggtcat catagaaaag tactacacgc 120 

gcgtgtgcga ggagatcgcc attatcccca 180 

acgtcacgca tctgatgaag cgaattcaga 24 0 

tgcaggagga ggagagagaa aggagagaca 3 00 

aggagattat tgaagcagat cctgacacta 360 

gtctgtccaa ccttcaggtc attcatccca 420 

agactgctca gcctacagtt gggatgaatt 480 

tttctgtagt gctgtattat tttcaataaa 540 

aaagagaaac tgggaggccc atattgcggg 600 

ctaaccggtg gtgtatcctg gttgtcgctg 660 
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acgcagaggt 


ttgctgtgta 


ctagatgggg 


17 

cagaatctgg 


ccgggtccct 


atagtggagg 


720 


tctgcatgaa 


ctacaataaa 


cgag 








744 


<210> 20 
<211> 1559 
<212> DNA 
<213> Homo sapien 












<400> 20 


qcqqcatkqq 

'^'^S i3 


cagtatcygc 


ckscatcctc 


ttccgtgagg 


cgcgctgaga 


60 


ccctQQacycr 


gcctcctgar 


aggwtgccgg 


tgcgggcgcc 


cgcggagagg 


gacccgtcgc 


120 


CatQQQCCQt 


qtqatccqtq 


gacagaggaa 


gggcgccggg 


tctgtgttcc 


gcgcgcacgt 


180 


gaagcaccgt 


aaaggcgctg 


cgcgcctgcg 


cgccgtggat 


tccgctgagc 


ggcacggcta 


240 


caCcaagggc 


atcgtcaagg 


acatcatcca 


cgacccgggc 


cgcggcgcgc 


ccctcgccaa 


300 




cqqqatccqt 


atcggtttaa 


gaagcggacg 


gagctgctca 


ttgccgccga 


360 


gggcattcac 


acqqqccagt 


ttgtgtattg 


cggcaagaag 


gcccagctca 


acattggcaa 


420 


tgtgctccct 


gtgggcacca 


tgcctgaggg 


tacaatcgtg 


tgctgcctgg 


aggagaagcc 


480 


tqgaqaccQt 


ggcaagctgg 


cccgggcatc 


agggaactat 


gccaccgtta 


tctcccacaa 


540 


ccctgagacc 


aagaagaccc 


gtgtgaagct 


gccctccggc 


tccaagaagg 


ttatctcctc 


600 


agccaacaga 


qctqtqqt tq 


gtgtggtggc 


tggaggtggc 


cgaattgaca 


aacccatctt 


660 


QaaQQCtQQC 


cqqqcqtacc 


acaaatataa 


ggcaaagagg 


aactgctggc 


cacgagtacg 


720 


qqqtQtQQCC 


atgaatcctg 


tggagcatcc 


ttttggaggt 


ggcaaccacc 


agcacatcgg 


780 


caagccctcc 


accatccgca 


gagatgcccc 


tgctggccgc 


aaagtgggtc 


tcattgctgc 


840 


ccgccggact 


ggacgtctcc 


ggggaaccaa 


gactgtgcag 


gagaaagaga 


actagtgctg 


900 


^99gcct:caa 


taaagtttgt 


gtttatgcca 


aaaaaaaaaa 


aaaaaaaaaa 


gaaaacaaaa 


960 


aaaaaaaaaa 


aaaaaaaggc 


agaaaaaaaa 


aaagaaaaac 


caaagcaaaa 


ggaagaaaaa 


1020 


agaaaaaaag 


aaaaagaaga 


agaagaggca 


gagaggagaa 


caagagggcg 


caccacacac 


1080 


agccagcggg 


gccaaacacc 


accccacggc 


gacaacaacc 


gacagagaga 


gagaccaacc 


1140 


ctcacccagg 


cggagaggga 


ggggcgcgcg 


gagaagaaaa 


aaccacaaaa 


aacagccgaa 


1200 


cacgcgcaac 


gaagcgacca 


cacacaaagc 


agaagaaaac 


accgaaaagc 


ctaagaagga 


1260 


agaggcacgc 


caccccgcca 


ccgggggcga 


caacggcgca 


ccgaccacgc 


acgacagagc 


1320 


atagcaccaa 


gcagcagaag. 


gcggagcaac 


catagacagc 


caccagaccc 


acagccgagg 


1380 


aacacgacag 


acgcaccacc 


cccaggcgcg 


ccgaagcata 


gacgaacaca 


cgagataaac 


1440 


accacagtga 


cggcggcgca 


gaggaagcac 


acgacacatc 


caacgaagga 


aagaacgaaa 


1500 
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18 








cacaaagcaa 


aacaggccac 


cgccagaaac 


aatcagtcac 


gccagcccca 


cacaccagc 


1559 


<210> 21 
<211> 1745 
<212> DNA 
<213> Homo sapien 












<400> 21 
gcggccggcc 


gggcaggtac 


ccagaagaga 


ggcgggtcaa 


agaagtagtg 


agataacgca 


60 


tatctcaggt 


cataggctat 


cccatcaccc 


tttatttggc 


gaaggrcaga 


gaaggaaatt 


120 


agtgatgatg 


mwggcagagg 


aagagaaagg 


tgcagcaaca 


rcacagcagg 


caagcataac 


180 


agcatgcatg 


aagcaaaagc 


ccaagatcgm 


magatgtggg 


ttcagatgca 


ggcaggcatg 


240 


acagcggtaa 


ggcataagca 


agaagcaaaa 


ctcaagcaag 


catcaaagca 


gcaaatacat 


300 


tgatccaggc 


aagcaactaa 


acaagaccaa 


gcctatttgg 


accagaaacc 


ctgatgacat 


360 


cacccaagag 


gagtatggag 


aattctacaa 


gagcctcact 


aatgactggg 


aagaccactt 


420 


ggcagtcaag 


cacttttctg 


tagaaggtca 


cgttggaatt 


caggscattg 


ctatttattc 


480 


ctcgtcgggc 


tccctttgac 


ctttgtgaga 


acaagaagaa 


aaagaacaac 


atcaaactct 


540 


atgtccgccg 


tgtgttcatc 


atggacagcc 


gtgatgagtt 


gataccagag 


tatctcaatt 


600 


ttatccgtgg 


tgtggttgac 


tctgaggatc 


tgcccctgaa 


catctcccga 


gaaatgctcc 


660 


agcagagcaa 


aatcttgaaa 


gtccattcgc 


aaaaacattg 


ttaagaagtg 


ccttgagctc 


720 


ttctctagag 


ctggcagaag 


acaaggagaa 


ttacaagaca 


attcctatga 


gggcacttct 


780 


cataaaaatc 


tcaacgcttg 


gaatccacga 


agacatccac 


taaccggcgc 


cgcctgtctg 


840 


agagctgctg 


cgctatcata 


cctcccagtc 


tggagatgag 


atgacatctc 


tgtcagagta 


900 


tgtttctcgc 


atgaaggaga 


cacagaagtc 


catctattac 


atcactggtg 


agagcaaaga 


960 


gcaggtggcc 


aactcagctt 


ttgtggagcg 


agtgcggaaa 


cggggcttcg 


aggtggtata 


1020 


caugaccgag 


CCCautgaCg 


agtactgtgt 


gcagcagctc 


aaggaatttg 


atgggaagag 


1080 


cctggtctca 


gttaccaagg 


agggtctgga 


gctgcctgag 


gatgaggagg 


agaagaagaa 


1140 


gatggaagag 


agcaaggcaa 


agtttgagaa 


cctctgcaag 


ctcatgaaag 


aaatcttaga 


1200 


taagaaggtt 


gagaaggtga 


caatctccaa 


tagacttgtg 


tcttcacctt 


gctgcattgt 


1260 


gaccagcacc 


tacggctgga 


cagccaatat 


ggagcggatc 


atgaaagccc 


aggcacctcg 


1320 


ggacaactcc 


accatgggct 


atatgatggc 


caaaaagcac 


ctggagatca 


accctgacca 


1380 


ccsccattgt 


ggagacgctg 


cggcagaagg 


ctgaggccga 


cgagaatgat 


aaggcagtta 


1440 


aggacctggt 


ggtgctgctg 


tttgaaaccg 


ccctggtatc 


ttctggcttt 


tcccttgagg 


1500 


agtccccaga 


cccactccaa 


ccgcgatcta 


tcgcatgatc 


aagctaggtc 


taggtattga 


1560 
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tgaagatgaa gtggcagcag aggaacccaa tgctgcagtt cctgatgaga tcccccctct 1620 
cgagggcgat gaggatgcgt ctcgcatgcg aggaagtcga gttaggttag gagtcgtact 1680 
tggaaacact tgtgcttttg gtttctgtgt cccccatggg gctcccactg cgcctcgagt 174 0 
gcccc 



1745 



<210> 22 

<211> 379 

<212> DNA 

<213> Homo sapien 

<400> 22 

gtcgccaggt ccgcaggtcc gcctggccga gcaggaggcg ccatcatggg agtggacatc 60 

cgccataaca aggaccgaaa ggttcggcgc aaggagccca agagccagga tatctacctg 120 

aggctgttgg tcaagttata caggtttctg gccagaagaa ccaactccac attcaaccag 180 

gttgtgttga agaggttgtt tatgagtcgc accaaccggc cgcctctgtc cctttcccgg 240 

atgatccgga agatgaagct tcctggccgg gaaaacaaga cggccgtggt tgtggggacc 300 

ataactgatg atgtgcgggt tcaggaggta cctcgccgcg accacgcatc catcacactg 360 

cgccgctcga catgcatct 379 

<210> 23 
<211> 1577 

<212> DNA 

<213> Homo sapien 

<400> 23 

ttcccgcggg aatctggacc gcgaccggtt ccacggacgg actccggggc gtccgtggga 60 

gctgggtgcc tgaggacctt ggcggtcggg ccggggcagg agggtgcagg tggtcgtgat 120 

tcgggatgca cagtaatctg gcggtcagct gcgggcccga ctgggatccg gggcttcggg 180 

ggcgcgcgcc gtccggggcc cgagctggga tcttgctgcg ccgctcacgt actcacctct 24 0 

gctagtgatg tctggagcta cggaattgtc atgtgggagg tcatgagcta tggagagcga 300 

ccctactggg acatgagcaa ccaggatgtc atcaatgccg tggagcagga ttaccggctg 360 

ccaccaccca tggactgtcc cacagcactg caccagctca tgctggactg ctgggtgcgg 420 

gaccggaacc tcaggcccaa attctcccag attgtcaata ccctggacaa gctcatccgc 4 80 

aatgctgcca gcctcaaggt cattgccagc gctcagtctg gcatgtcaca gcccctcctg 54 0 

gaccgcacgg tcccagatta cacaaccttc acgacagttg gtgattggct ggatgccatc 600 

aagatggggc ggtacaagga gagcttcgtc agtgcggggt ttgcatcttt tgacctggtg 660 

gcccagatga cggcagaaga cctgctccgt attggggtca ccctggccgg ccaccagaag 720 

aagatcctga gcagtatcca ggacatgcgg ctgcagatga accagacgct gcctgtgcag 780 
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gtccgacacc 


ggctcccacg 


gggaccctga 


ggactttcgg 


actcttggac 


ttttggatgc 


agtttgggaa 


aggcccaagc 


tgggacttct 


gccccaaacc 


ccttcatatt 


gaagatggat 


cccctcaggg 


cccagacctt 


cctgctctcc 


ctgttcttca 


gtgctggagg 


tcctggcagg 


ggcccagccc 


tggcaggggt 


ctggcccccc 


aactggagga 


ggggactcca 


ggaatgggga 


gcacctccag 


tttgcacagg 


gatttgttct 


ccttggtgct 


gtcataaaag 


ggcaggcagg 


agagactgac 


tctcagagcc 


agagatggga 


cgcgcgcgcg 


tgtgtgtgtg 


cacgcactgg 


aagcttggcc 


ctgtgcccta 


caatggggcc 


agatgaaaaa 


aaaaaaa 




<210> 24 
<211> 1833 

<212> DNA 

<213> Homo sapien 




<400> 24 
tgcaaggcca 


ggaagctaca 


gggaaagttc 


tccccaccgc 


gcttggcgat 


gcatccatag 


cccagcgaag 


gggagagttt 


gctcccaaag 


aggaaagaac 


tggaaccagc 


ctctgacctg 


acagagagga 


gtcctgccca 


gctcttggat 


caggggtgac 


cctggagccc 


agcctgcccc 


tgtctgtgat 


agggcccctg 


acttcctctc 


cttgggcagc 


tcagtggctc 


tgaactgcac 


cctgccttca 


gtccagtggc 


tgaaagacgg 


cctccacgag 


tactcctggg 


tcaaggccaa 


9g99gtcaac 


gtgaccagca 


ctgaagtcta 


cagcttctcc 


tccttcactc 


ttcagagagc 


ggcctccctc 


ccggtcctgc 


tggccccgct 
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ggaccgtgca 


gggatgccaa 


gcagccggct 


840 


ctggccttag 


gctgtggccc 


agaagctgga 


900 


ccaggcctgt 


gttccctccc 


caggaagtgc 


960 


taggagaggg 


ggtgatgacc 


cctccccaag 


1020 


agcaggggat 


ccccacaacc 


tcacacctgt 


1080 


gtcaggctgg 


ggtaagccgg 


ggttccacag 


1140 


aggtaggcgg 


agagcagtcc 


ctccctcagg 


1200 


aatgcgacac 


caccaccctg 


aagccagcct 


1260 


gggggccgag 


ggccctgtcc 


ccacccccgc 


1320 


ggcaggctga 


ggagttgccc 


tttgcccccc 


1380 


tgtgtgagtg 


tgtgtgtgtg 


tgtgtgcgcg 


1440 


cctgcacaga 


gagcatgggt 


gagcgtgtaa 


1500 


agctgggccg 


acagcagaat 


aaaggcaata 


1560 








1577 


tgatcaaaat 


acacaaagac 


acaagccagg 


60 


cagccttggt 


gctgtggaca 


ctccctgggg 


120 


gcgcaccaat 


gaccaacatt 


tgccccccgg 


180 


tccaggtgcc 


ctgtccagct 


gactgcaagg 


240 


cagtctgctg 


gccgaggagc 


ccggtggagc 


300 


gaggaggccc 


cggctcagag 


ccatgccagg 


360 


cccgtctgaa 


gaccaggtgc 


tgaggcctgc 


420 


ggcttgggta 


gtctctgggc 


cccactgctc 


480 


gcttccattg 


ggaattgggg 


gccactacag 


540 


cctgtcagag 


gtgcttgtgt 


ccagtgtcct 


600 


tggggcc.ttc 


acctgctcca 


tccagaacat 


660 


tggccctaca 


agccacgtgg 


ctgcggtgct 


720 


gctggccgcc 


ctgctctatg 


tcaagtgccg 


780 
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tctcaacgtg 


ctgctctggt 


accaggacgc 


gcatggggag 


gtggagataa 


acgacgggaa 


840 


gctctacgac 


gcctacgtct 


cctacagcga 


ctgccccgag 


gaccgcaagt 


tcgtgaactt 


900 


catcctaaag 


ccgcagctgg 


agcggcgtcg 


gggctacaag 


ctcttcctgg 


acgaccgcga 


960 


cctcccgccg 


cgcgctgagc 


cctccgccga 


cctcttggtg 


aacctgagcc 


gctgccgacg 


1020 


cctcatcgtg 


gtgctttcgg 


acgccttccc 


gagccgggcc 


tggtgcagcc 


acagcttccg 


1080 


gtgggtcccg 


cgcggggttg 


ggtgggcccc 


agcgtacacc 


cacccccctg 


acggtcccgc 


1140 


cccgcaggga 


gggcctgtgc 


cggctgctgg 


agctcacccg 


cagacccatc 


ttcaccacct 


1200 


tcgagggcca 


gaggcgcgac 


cccgcgcacc 


cggcgccccg 


cctgctgcgc 


cagcaccgcc 


1260 


acctggtgac 


cttgctgctc 


tggaggcccg 


gccccgtgac 


tccttcctcc 


gatttttgga 


1320 


aagaagtgca 


gctggcgctg 


ccgcggaagg 


tgcggtacag 


gccggtggaa 


ggagaccccc 


1380 


agacgcagct 


gcaggacgac 


aaggacccca 


tgctgattct 


tcgaggccga 


gtccctgagg 


1440 


gccgggccct 


ggactcagag 


gtggacccgg 


accctgaggg 


cgacctgggt 


gtccgggggc 


1500 


ctgtttttgg 


agagccatca 


gctccaccgc 


acaccagtgg 


ggtctcgctg ggagagagcc 


1560 


ggagcagcga 


agtggacgtc 


tcggatcccg 


gctcgcgaaa 


ctacagtgcc 


cgcacagact 


1620 


tctactgcct 


ggtgtccaag 


gatgatatgt 


agctcccacc 


ccagagtgca 


ggatcatagg 


1680 


gacagcgggg 


gccagggcag 


cggcgtcgct 


cctctgctca 


acaggaccac 


aacccctgcc 


1740 


agcagccctg 


ggaccctgcc 


agcagccctg 


ggaaaaggct 


gtggcctcag 


ggcgcctccc 


1800 


agtgccagaa 


aataaagccc 


t- t- t-nnst t* t* r' 
u I. U u^^a I. 


tga 






1833 


<210> 25 
<211> 2138 
<212> DNA 
<213> Homo sapien 












<400> 25 

tgcaaggcca ggaagctaca 


gggaaagttc 


tgatcaaaat 


acacaaagac 


acaagccagg 


60 


tccccaccgc 


gcttggcgat 


gcatccatag 


cagccttggt gctgtggaca ctccctgggg 


120 


cccagcgaag gggagagttt 


gctcccaaag 


gcgcaccaaC 


gaccaacatt 


tgccccccgg 


160 


aggaaagaac 


tggaaccagc 


ctctgacctg 


tccaggtgcc 


ctgtccagct gactgcaagg 


240 


acagagagga 


gtcctgccca 


gctcttggat 


cagtctgctg gccgaggagc 


ccggtggagc 


300 


caggggtgac 


cctggagccc 


agcctgcccc 


gaggaggccc 


cggctcagag 


ccatgccagg 


360 


tgtctgtgat 


agggcccctg 


acttcctctc 


cccgtctgaa 


gaccaggtgc 


tgaggcctgc 


420 


cttgggcagc 


tcagtggctc 


tgaactgcac 


ggcttgggta. 


gtctctgggc 


cccactgctc 


480 


cctgccttca 


gtccagtggc 


tgaaagacgg 


gcttccatcg ggaattgggg gccactacag 


540 



wo 2004/050860 PCT/US2003/040063 

22 



cctccacgag 


tactcctggg 


tcaaggccaa 


cctgtcagag gtgcttgtgt 


ccagtgtcct 


600 


gggggtcaac 


gtgaccagca 


ctgaagtcta 


tggggccttc 


acctgctcca 


tccagaacat 


660 


cagcttctcc 


tccttcactc 


ttcagagagc 


tggccctaca 


agccacgtgg 


ctgcggtgct 


720 


ggcctccctc 


ctggtcctgc 


tggccctgct 


gctggccgcc 


ctgctctatg 


tcaagtgccg 


780 


tctcaacgtg 


ctgctctggt 


accaggacgc 


gtatggggag gtggagataa 


acgacgggaa 


840 


gctctacgac 


gcctacgtct 


cctacagcga 


ctgccccgag gaccgcaagt 


tcgtgaactt 


900 


catcctaaag 


ccgcagctgg 


agcggcgtcg 


gggctacaag 


ctcttcctgg 


acgaccgcga 


960 


cctcctgccg 


cgcgctgagc 


cctccgccga 


cctcttggtg 


aacctgagcc 


gctgccgacg 


1020 


cctcatcgtg 


gtgctttcgg 


acgccttcct 


gagccgggcc 


tggtgcagcc 


acagcttccg 


1080 


gtgggtcccg 


cgcggggttg 


ggtgggcccc 


agcgtacacc 


cacccccctg 


acggtcccgc 


1140 


cccgcaggga 


gggcctgtgc 


cggctgctgg 


agctcacccg 


cagacccatc 


ttcatcacct 


1200 


tcgagggcca 


gaggcgcgac 


cccgcgcacc 


cggcgctccg 


cctgctgcgc 


cagcaccgcc 


1260 


acctggtgac 


cttgctgctc 


tggaggcccg 


gctccgtggt 


gcggagcagg 


cgcgggaggg 


1320 


tccggggcta 


gcggcgggtt 


agagatgggc 


ggtgcccggg 


ctccaggctg 


ggacccctcc 


1380 


gtggggagct 


ctgcggcacc 


acgctttgtg 


aatgggccct 


ggggggaggt 


tccgctgcct 


1440 


ggggccccga 


tgcggggagc 


cgcccttgag 


gcccccggag ccacggaata gctgtcgcag 


1500 


ggcgtggaac 


ccgtgggcag 


ccgcaggtgt 


gctcttgggg 


gccaggacgc 


caggggcttc 


1560 


cgaggtgttc 


acacctgcaa 


accgccccga 


cctggccccc 


aggactcctt 


cctccgattt 


1620 


ttggaaagaa 


gtgcagctgg 


cgctgccgcg 


gaaggtgcgg 


tacaggccgg 


tggaaggaga 


1680 


cccccagacg 


cagctgcagg 


acgacaagga 


ccccatgctg attcttcgag gccgagtccc 


1740 


tgagggccgg 


gccctggact 


cagaggtgga 


cccggaccct 


gagggcgacc 


tgggtgtccg 


1800 


ggggcctgtt 


tttggagagc 


catcagctcc 


accgcacacc 


agtggggtct 


cgctgggaga 


1860 


gagccggagc 


agcgaagtgg 


acgtctcgga 


tctcggctcg 


cgaaactaca 


gtgcccgcac 


1920 


agacttctac 


tgcctggtgt 


ccaaggatga 


tatgtagctc 


ccaccccaga 


gtgcaggatc 


1980 


atagggacag 


C99999ccag 


ggcagcggcg 


tcgctcctct 


gctcaacagg 


accacaaccc 


2040 


ctgccagcag 


ccctgggacc 


ctgccagcag 


ccctgggaaa 


aggctgtggc 


ctcagggcgc 


2100 


ctcccagtgc 


cagaaaataa 


agtccttttg 


gattctga 






2138 



<210> 26 

<211> 676 

<212> DNA 

<213> Homo sapien 
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<400> 26 
cccccgcccc 


tgcgccgccg 


acgcccgccc 


tcccgcagag 


ccctccgccg 


acct cttggt 


o u 


gaacctgagc 


cgctgccgac 


gcctcatcgt 


ggtgctttcg 


gacgccttcc 


tgagccgggc 




ctggtgcagc 


cacagcttcc 


gactccttcc 


tccgattttt 


ggaaagaagt 


gcagctggcg 


180 


ctgccgcgga 


aggtgcggta 


caggccggtg 


gaaggagacc 


cccagacgca 


gctgcaggac 


240 


gacaaggacc 


ccatgctgat 


tcttcgaggc 


cgagtccctg 


agggccgggc 


cctggactca 


300 


gaggtggacc 


cggaccctga 


gggcgacctg 


ggtgtccggg 


ggcctgtttt 


tggagagcca 


360 


tcagctccac 


cgcacaccag 


tggggtctcg 


ctgggagaga 


gccggagcag 


cgaagtggac 


420 


gtctcggatc 


tcggctcgcg 


aaactacagt 


gcccgcacag 


acttctactg 


cctggtgtcc 


480 


aaggatgata 


tgtagctccc 


accccagagt 


gcaggatcac 


agggacagcg 


ggggccaggg 


540 


cagcggcgtc 


gctcctctgc 


tcaacaggac 


cacaacccct 


gccagcagcc 


ctgggaccct 


600 


gccagcagcc 


ctgggaaaag 


gctgtggcct 


cagggcgcct 


cccagtgcca 


gaaaataaag 


660 


tccttttgga 


ttctga 










676 


<210> 27 
<211> 1333 
<212> DNA 
<213> Homo sapien 












<400> 27 
agatgctgtc 


gagcggcgca 


gtgtgatgga 


tcgtggtcgc 


ggcgaggaat 


ggagagcacg 


60 


gtctgaatct 


gcacagagca 


agatgctgag 


tggagtcggg 


ggctttgtgc 


tgggcctgct 


120 


cttccttggg 


gccgggctgt 


tcatctactt 


caggaatcag 


aaaggacact 


ctggactcca 


180 


gccaacagga 


ttcctgagct 


gaaatgcaga 


tgaccacact 


caaggaagaa 


ccttctgccc 


240 


cagctttgca 


ggatgaaaag 


ctttcctgct 


tggcagttat 


tcttccacaa 


gagagggctt 


300 


tctcaggacc 


tggttgctac 


tggttcggca 


actgcagaaa 


atgtcctccc 


ttgtggcttc 


360 


ctcagctcct 


gcccttggcc 


tgaagtccca 


gcattgatgg 


cagcgccCca 


tcttcaactt 


420 


ttgtgctccc 


ctttgcctaa 


accgtatggc 


ctcccgtgca 


tctgtactca 


ccctgtacga 


480 


caaacacatt 


acattattaa 


atgtttctca 


aagatggagt 


taaatatcat 


ctggtccatt 


540 


tggctccaaa 


gacagaaaat 


gaaaagaaaa 


agggaagatt 


attttcctaa 


tagaataatg 


600 


attttcatgt 


atatgtcatg 


agtgtgtgag 


gtaatgcgta 


tgttaaacag 


ctggatttag 


660 


acattccaca 


ctatagccat 


atatcaaaac 


ttcatgctgt 


acaatataaa 


tatmctatac 


720 


aatttttact 


tgtcaattaa 


aaaagtaatc 


ctaacattta 


aaaaggcaat 


gcataaaaac 


780 


tgagaacaga 


ctacaacaac 


tgaaacaaac 


tgggcaaaca 


tgagatgaga 


aaccagccag 


840 


caagtcaatc 


agaactcttt 


cttcatctcg 


tctacaatat 


tttgtattta 


taactgcaaa 


900 
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ttagtgtata gtgtttcact ccagagactt 
cagatttcag agaaagacac atttagaaga 
agtcagtatg aaatgtcaaa tctaaaagca 
atcatttgag cataatttct ccaccgtcag 
taaaagttgt ttttatgcat atcttatttt 
gtacaagtgc atataaatcc tgtaagagta 
ttatgcctgt tctgtttgat cacagagttg 
gaaatgattt tct 

<210> 28 

<211> 1228 

<212> DNA 

<213> Homo sapien 



<400> 28 



ctttgggagc 


tggctctctc 


cataagcttt 


acactctgga 


cttcagccaa 


caggattcct 


aagaacct tic 


Cgccccagc^ 


c cgcaggacg 


cacaagagag 


ggctttctca 


ggacctggtt 


ctcccttgtg 


gcttcctcag 


ctcctgccct 


cctcatcttc 


aacttttgtg 


ctcccctttg 


actcaccctg 


tacgacaaac 


acactacatt 


atcacctggt 


ccatttggct 


ccaaagacag 


cctaatagaa 


taatgatttt 


catgtatatg 


aatagctgga 


tttagacatt 


ccacactata 


ataaatatmc 


tatacaattt 


ttacttgtca 


gcaatgcata 


aaaactgaga 


acagactata 


cgagaaacca 


gccagcaagt 


caatcagaac 


atttataact 


gtaaattagt 


gtatagtgtt 


atcaaaggac 


ttgtacagat 


ttcagagaaa 


tatgttctat 


ctgagagtca 


gtatgaaatg 


tatttcaaat 


taataatcat 


ttgagcataa 


tattttcaat 


caaattaaaa 


gttgttttta 


tgtacatacg 


tagcagcaca 


agtgcatata 
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caataataca 


gcgttatcaa 


aggacttgta 


960 


tggagagttc 


tccgttatgt 


tctatctgag 


1020 


cataattcag 


aggtctatct 


caaattaata 


1080 


agacgactgt 


tattttattt 


tcaatcaaat 


1140 


tagttatatg 


ttacttgtac 


atacgtagca 


1200 


taatcctata 


gaaacacatt 


aagctgataa 


1260 


caacaaatag 


gcctcatttc 


ctaagtcgag 


1320 








1333 


tctggaggag 


gaacwrtggc 


tttgctgagg 


60 


gagctgaaat 


gcagatgacc 


acattcaagg 


120 


aaaagctttc 


ctgcttggca 


gttattcttc 


180 


gctactggtt 


cggcaactgc 


agaaaatgtc 


240 


tggcctgaag 


tcccagcatt 


gatggcagcg 


300 


cctaaaccgt 


atggcctccc 


gtgcatctgt 


360 


attaaatgtt 


tctcaaagat 


ggagttaaat 


420 


aaaatgaaaa 


gaaaaaggga 


agattacttt 


480 


tcacgagtgt 


gtgaggtaat 


gcgtatgtta 


54 0 


gccatatatc 


aaaacttcat 


gctgtacaat 


600 


attaaaaaag 


taatcctaac 


atttaaaaag 


660 


acaactgaaa 


caaactgggc 


aaacatgaga 


720 


tctttcttca 


tctcgtctac 


aatattttgt 


780 


tcactccaga 


gacttcaata 


atacagtgtt 


840 


gacacattca 


gaagacggag 


agttctccgt 


900 


tcaaatctaa 


aagtacataa 


ttcagaggtc 


960 


tttctccact 


gtcagagacg 


actgttattt 


1020 


tgcatatctt 


atttttagtt 


atatgttact 


1080 


aatcctgtaa 


gagtataatc 


ctatagaaat 


114 0 
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atattaagct 


gataattatg 


tctgtcctgt 


ttgatcacag 


agttgcaaca 


aataggcctc 


1200 


atttcctaag 


ttgaggaaat 


aattttct 

Www W w W 








1 22fl 


<210> 29 
<211> 2619 
<212> DNA 
<213> Homo sapien 












<400> 29 
ggggaacaca 


acttaggcta 


ggtacgaggc 


ctgggtggtc 


mggctatcgg 


actctccatt 


60 


gccctgcacg 


tgggttttga 


caacttcgag 


cagctgctct 


cgggggctca 


ctggatggac 


120 


cagcacttcc 


gcacgacgcc 


cctggagaag 


aacgcccccg 


tcttgctggc 


cctgctgggt 


180 


atctggtaca 


tcaactgctt 


tgggtgtgag 


acacacgcca 


tgctgcccta 


tgaccagtac 


240 


ctgcaccgct 


ttgctgcgta 


cttccagcag ggcgacatgg 


agtccaatgg 


gaaatacatc 


300 


accaaatctg 


gaacccgtgt 


ggaccaccag 


acaggcccca 


ttgtgtgggg ggagccaggg 


360 


accaatggcc 


agcatgcttt 


ttaccagctc 


atccaccaag 


gcaccaagat 


gataccctgt 


420 


gacttcccca 


tcccggtcca 


gacccagcac 


cccatacgga 


agggtctgca 


tcacaagatc 


480 


ctcctggcca 


actccttggc 


ccagacagag 


gccctgatga 


ggggaaaatc 


gacggaggag 


540 


gcccgaaagg 


agctccaggc 


tgcgggcaag agtccagagg 


accttgagag gctgctgcca 


600 


cataaggtct 


ttgaaggaaa 


tcgcccaacc 


aactctattg 


tgttcaccaa 


gctcacacca 


660 


ttcatgcttg 


gagccttggt 


cgccatgtat 


gagcacaaga 


tcttcgttca 


gggcatcatc 


720 


tgggacatca 


acagctttga 


ccagtgggga 


gtggagctgg 


gaaagcagct 


ggctaagaaa 


780 


atagagcctg 


agcttgatgg 


cagtgctcaa 


gtgacctctc 


acgacgcttc 


taccaatggg 


840 


ctcatcaact 


tcatcaagca 


gcagcgcgag gccagagtcc 


aataaactcg 


tgctcatctg 


900 


cagcctcctc 


tgtgactccc 


ctttctcttc 


tcgtccctcc 


tccccggagc 


cggcactgca 


960 


tgttcctgga 


caccacccag 


agcaccctct 


ggttgtgggc 


ttggaccacg agcccttagc 


1020 


agggaaggct 


ggtctccccc 


agcctaaccc 


ccagcccctc 


catgtctatg 


ctccctctgt 


1080 


gctagaattg 


gctgaagtgt 


ttttgtgcag ctgacttttc 


tgacccatgt 


tcacgttgtt 


1140 


cacatcccat 


gtagaaaaat 


aaagatgcca 


cggaggaggt 


tgtaggctca 


gcctctgatt 


1200 


ttttttttcc 


tgtgatggtg 


ctttatgtag 


cagagggcag 


gagcgctcag 


caggacgcag 


1260 


gctgtgcctc 


tgcggacact 


taacactaag 


tggtgagcgg 


gtctagagtg gagcaaggtg 


1320 


ccctgagaag 


acaatagtgg 


ggtgggggca caatcagtca ggacggcaca gktrcrggct 


1380 


cayrcctgta 


atcycagcac 


tttgggaggc 


craggyaggk ggaccacctg aggtcagacc 


1440 


agtctggtca 


acatggtgaa 


accccgtctg 


taccaaaaat 


acaaaaacca 


gctgggcatg 


1500 
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gtggcacctg 


cctgtggtcc 


cagctactcg 


ggaggccgag gcagaagact 


cgctggaacc 


1560 


cgggaggcgg 


agcttgccgc 


gagccgagac 






tgggtgacag 


X620 


agccagactc 


catctcaaca 


acagacgcac 






gcacagggcc 


1680 


gaggagcgca 


tgtcacacag 


gcggatggtc 






gaacacgttg 


1740 


cactggtgcg 


ggcggaactc 


ggctgcagtg 






catgttagca 


1800 


ggcttgatgt 


ccacgatgtt 


aaagcttcca 


f r>^ nf-na ^ t* !■ 

uccgugatuu 


r"t* is a ri t" /~T ^ ^ a 


taaattaatt 


1860 


ctcaggtcat 


ctgcagaaag 


atatgtttca 


cgatcaccac 


u uaCCgaoau 


ggaatttata 


1920 


tgatatgtgt 


gagcatttgc 


aaaaattcgc 


L. ^ a ^ u v.. 3 




aagacccatg 


1980 


ggcttcaata 


ttgggacccg 


tagcgccgtg 






tcttccatct 


2040 


tcgtccttca 


ggttataacc 


ttctgctctt 






ccataatttt 


2100 


atagttttat 


catttgtaga 


cagtagaaaa 






taaccaccta 


2160 


attttattaa 


ttttttcctc 


aatttctaga 




a.y ^ >w Ca c* CA ^ ^ N— 


cggttcatga 


2220. 


ctttgaaagg 


tgctgtaaac 


attatattct 






tttattctct 


2280 


tgttcacgct 


gaaaaataac 


aactctgccg 


cccttgtctc 


ctgttgcaag 


aagatctcca 


2340 


aaataat taa 


ac^caacggt 


QoraaatoratQ 


tccgcttcgg 


ccacgtcctc 


qtCQatqqcC 


2400 


cccttgacct 


gcgagaagca 


ccactggaag 


tcgttgccgc 


ccgcggggca 


gccgccgcct 


2460 


ccggctcctg 


ccatggcagc 


cggcggcggg 


acgggcgggg 


o ^ wy i^y y %»ciy 


accagggaga 


2520 


cggcctccgc 


gccgctgcag 


ccgagcgctc 


agccccgggg 


ccgccaccac 


cgccggcgcc 


2580 


gccgccgccg 


ccgccgccgc 


cggggaggga 


tyttyykct 






2619 


<210> 30 
<211> 2564 
<212> DNA 
<213> Homo sapien 












<220> 

<221> misc_feature 
<222> (554) . . (554) 
<223> n=a, c, g or t 












<400> 30 
actggtgtgt 


agttcttaaa 


gagcagttat 


cactgttccc 


tgccttgtgg 


tgtcatcgga 


60 


tgtgtccccc 


tcgctccaac 


tgagctgtac 


agcgagtgac 


ccaggctggg 


actgacccct 


120 


gctgagaagt 


accaggcggt 


cttgtccctc 


tgcctgaagc 


ccagacagtg 


ctctcaagca 


180 


taactgatgt 


cccgcccatc 


agatcctccc 


ggccaacttc 


ttggcccaga 


cagaggccct 


240 


gatgagggga 


aaatcgacgg 


aggaggcccg 


aaaggagctc 


caggctgcgg 


gcaagagtcc 


300 
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agaggacctt 


gagaggctgc 


tgccacataa 


ggtcagcact 


tctgcatttg 


gctttggggt 


360 


gcatgctgga 


gttggaggtg 


tgaagctatg 


gcaccaggca 


ggggcttggg 


gcatgcctgg 


420 


cttgtcctac 


agaatgggag ggcctgtcct 


cacagacctg 


cacgtctcag 


cctctggggc 


480 


agggcgtgct 


tccccccaag gggtttaggg 


atcaggactc 


tcttggagac 


attccttggt 


540 


gttttctgca 


ggtntttgaa ggaaatcgcc 


caaccaactc 


tattgtgttc 


accaagctca 


600 


caccattcat 


gcttggagcc 


ttggtcgcca 


tgtatgagca 


caagatcttc 


gttcagggca 


660 


tcatctggga 


catcaacagc 


ttcgaccagt 


ggggagtgga 


gctgggaaag 


cagctggcta 


720 


agaaaataga 


gcctgagctt 


gatggcagtg 


ctcaagtgac 


ctcccacgac 


gcttctacca 


780 


atgggctcat 


caacttcacc 


aagcagcagc 


gcgaggccag 


agtccaataa 


actcgtgccc 


840 


atctgcagcc 


tcctctgtga 


ctcccctttc 


tcttctcgtc 


cctcctcccc 


ggagccggca 


900 


ctgcatgttc 


ctggacacca 


cccagagcac 


cctctggttg 


tgggcttgga 


ccacgagccc 


960 


ctagcaggga 


aggctggtct 


cccccagcct 


aacccccagc 


ccctccatgt 


ctatgctccc 


1020 


tctgcgttag 


aattggctga 


agtgtttttg 


tgcagctgac 


ttttctgacc 


catgttcacg 


1080 


ttgttcacat 


cccatgcaga 


aaaataaaga 


tgccacggag 


gaggttgtag 


gctcagcctc 


1140 


tgatttttct 


tttcctgtga 


tggtgcttta 


tgtagcagag 


ggcaggagcg 


ctcagcagga 


1200 


cgcaggctgt 


gcctctgcgg 


acacttaaca 


ctaagtggtg 


agcgggtcta 


gagtggagca 


1260 


aggtgccctg 


agaagacaat 


agtggggtgg 


gggcacaacc 


agtcaggacg 


gcacagktrc 


1320 


rggctcayrc 


ctgtaatcyc agcactttgg 


gaggccragg 


yaggkggatc 


acctgaggtc 


1380 


agaccagtct 


ggtcaacatg 


gtgaaacccc 


gtctgtacca 


aaaatacaaa 


aaccagctgg 


1440 


gcatggtggc 


acctgcctgc 


ggtcccagct 


actcgggagg 


ccgaggcaga 


agactcgctg 


1500 


gaacccggga 


ggcggagctt 


gccgtgagcc 


gagactgtgt 


cactgcaccg 


cagcctgggt 


1560 


gacagagcca 


gactccatct 


caacaacaga 


cgcacttaca 


cttggagtgt 


ctgtcgcaca 


1620 


gggccgagga 


gcgcatgtca 


cacaggcgga 


tggtcccttt 


gctactgctg 


tagacgaaca 


1680 


cgttgcactg 


gtgcgggtgg 


aacccggctg 


cagtgatgac 


ttcggtcagc 


tcctccatgt 


1740 




gatgtccacg 


atgttaaagc 




f" t" ^ f^t" fifia 

^ci i> c u w uact^ 




1800 


taattctcag 


gtcatctgca 


gaaagatatg 


tttcatgatc 


actatttact 


gaaatggaat 


1860 


ttatatgata 


tgtgtgagca 


tttgcaaaaa 


ttcgccgtgg 


actcgcttct 


accataagat 


1920 


ccatgggcct 


caatattggg 


acccgtagcg 


ccgtgatcct 


aaatgggtct 


cgaagtcttc 


1980 


catcttcgtc 


tttcaggtta 


taaccttctg 


ctcttttatc 


ccgttcactt 


attttccata 


2040 


attttatagt 


tttatcattt 


gtagacagta 


gaaaatgagc 


agcattctgt 


tgtggtaacc 


2100 
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acctaatttt attaattttt tcctcaattt ctagactttt caaatagtca aactccggtt 2160 

catgactttg aaaggtgctg taaacattat attctcccct agaatgaggg cggcttttat 2220 

tctcttgttc acgctgaaaa ataacaactc tgccgccctt gtctcctgtt gcaagaagat 2280 

ctccagagta attaaactca acggtggaaa tgatgtccgc ttcggccacg tcctcgtcga 234 0 

tggccccctt gacctgcgag aagcaccact ggaagtcgtt gccgcccgcg gggcagccgc 2400 

cgcctccggc tcctgccatg gcagccggcg gcgggacggg cggggaccgc ggcagaccag 2460 

ggagacggcc tccgcgccgc tgcagccgag cgctcagccc cggggccgcc accaccgccg 2520 

gcgccgccgc cgccgccgcc gccgccgggg agggatytty ykct 2564 

<210> 31 
<211> 5718 
<212> DNA 
<213> Homo sapien 

<400> 31 

catcaatgaa aactgtgctc agcattaacc ttcagaagct gatgacactg ttctacttta 60 

agcaggaaag agaagttttc tccgcacctg ttgagggcac gtcttcctcc gcgcctgttg 120 

agggcacttc tctaaagtga atggatcctg ctttttgcgg acataggtgc gtggaattgc 180 

cagcgcctct cagaacatca agactgcact ggggagaacc cacagaaacg gacaggactg 240 

ctggccaggt ggcttccccc gcaggcgccc cagagctagg tgacgacacc tccagggaac 300 

gcgcgccgca gacctcaagg cagtggcagc tccacgggag cagcagtgcc acgtggaacc 360 

tctacacacg ggacccgcct gggctgcacc ttctgctctg ccctgggggt ggaggaggtg 420 

ggctatttag tgggaaatac attctagagg aagatggaaa agggtgtcca aatacgcagt 480 

gaaaatccag ttaaacattt aaaatttatt gaactttttg gtcaaaatag ttgaacaaat 540 
atatacaatc ccattttact agaactgtct taaggaccag ggctgactaa tgcttaaata 
actgttccac acctcagact ttgtaaattg tgtcacaatc aatatacagt atataaatta 

ttttttagtt aaaataaagg tacctttaga ttctcaattt ctaacttaat gtgaactcaa 720 

gtggcactgt gttagcattg attcagggta atgacggaca tcctgttctt gcccatgtgg 780 
attagtaatc agcacacacc ttcatctcca cagggggatg ggggatcagc acacaccttc 
atctccacag ggggaccggg gatcagcaca cgccttcatc tccacagggg gatgggggat 

cagcacacac cttcatctcc acagtgggat ggcggatcag cacacgcctt catctccaca 960 

gggggatggg ggatgagcac acgccttcat ctccacaggg ggatggcgga tcagcacacg 1020 

ccttcatctc cacaggggga cggcggatca gcacacgcct tcatctccac agggggacgg 1080 

gggatcagca cacgccttca tctccacagg aggacggggg atcagcacac accttcatct 1140 



600 
660 



840 
900 
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accQQQcratc 


agcacacacc 


ttcatctcca 


cagggggatg ggggatcagc 


1200 


;4 r*A ct t C 


atctccacag 


qqqqatgqgg 


gatcagcaca 


cgccttcatc 


tccacagggg 


1260 




cagcacacgc 


Cttcatctcc 


acagggggat 


gggggatcag 


cacacgcctt 


1320 




QtQQQatQQC 


ggatcagcac 


acgccttcat 


ctccacaggg 


ggatggggga 


1380 


tcagcacaca 


ccttcatctc 


cacaggggga 


tgggggatca 


gcacacgcct 


tcatctccac 


1440 


agtgggatgg 


cggatcagca 


cacgcct tea 


tctccacagg 


gggatggggg 


atcagcacac 


1500 


gccttcatct 


ccacagtggg 


acgqqqgatc 


agcacacacc 


tccatctcca 


cagggggacg 


1560 


ggggaccagc 


acacacctcc 


atctccacag 


gggcacgagg 


aagcagcaca 


caccttcatc 


1620 


tccacaggag gacgggggat 


cagcacacat 


cttcatctcc 


acagggggat 


gggggatcag 


1680 


cacacacctc 


catctccaca 


gggggacaag 


agatcagcac 


acaccttcat 


ccccacaggg 


1740 


ggacagggga 


tcagcatacc 


ccttcatctc 


cacaggggga 


tgggggatca 


gcacacacct 


1800 


tcatctccac 


agggggacgg 


gggatcagca 


taccccttca 


tctccacagg gggatggggg 


1860 


atcagcacac 


accttcatcc 


CCacaqqqqq 


atgggggatc 


agcacacacc 


tccatctcca 


1920 


caggggcatg 


aggaatcagc 


acacaccttc 


atctccacag 


gaagacgggg 


gatcagcaca 


1980 


ccccttcatc 


tccacagggg 


gatgggggat 


cagcacacac 


cttcatctcc 


acagggggac 


2040 


gggggatcag 


cagtacacac 


cttcatctag 


ataggggaac 


aggggatcag 


tacacacctc 


2100 


tatctccaca 


ggggcacgag 


ggatcagcac 


acacctccat 


ctccacaggg 


catgggggat 


2160 


cagcacaccc 


cttcatctcc 


acagggggac ggggatcagc 


acacaccttc 


atctccacag 


2220 


ggggacgggg gatcaacaca 


ccccttcatc 


tccacatggg 


gatgggggat 


cagcacacac 


2280 


cttcatctcc acagggggac gggggatcag cacacgcctc 


catgtccaca ggggcacggg 


2340 


ggatcagcac 


accccttcat 


cttcacaggg 


agatggggga 


tcagcacaca 


ccttcatctc 


2400 


cacaggggga 


cggggatcag 


cacacaccat 


ctccacaggg 


ggacagggga 


tcaacacacc 


2460 


ccttcatctc 


cacatgggga 


tgggggatca 


gcacacacct 


tcatctccac 


agggggatgg 


2520 


ggatcagcac 


acgcctccat 


ctccacaggg 


gcgcggggga 


ttggcacacc 


ccttcatctc 


2580 


cacagggaga 


tgggggatca 


gcacacacct 


tcatctccac 


agggggacgg gggatcagca 


2640 


caccccttca 


tctccacagt 


gggacagggg 


atcagcatat 


cccttcatct 


ccacaggggg 


2700 


atggggatca 


gcacacacct 


tcatctccac 


agggagatgg gggatcagca cgcaccttca 


2760 


tctccacagg 


ggcacggggg 


atcagcacac 


accttcatct 


ccacaggggg acgggggatc 


2820 


agcacacacc 


ttcatccaga 


caggggcatg 


aggaatcagc 


acacaccttc 


atctagacag 


2880 


ggggatggga 


tgggggatca 


gcacacacct 


ccatctccac 


aggggcacga 


gggagcagca 


2940 


cacacctcca 


tctccacagg 


gcacagggga 


tcagcacaca 


cctccttctc 


cacaggggca 


3000 
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caggcaatca 


gcacatacct 


tcatctagac 


agggtgacgg 


gcgatcagca 


cacacctcca 


3060 


tctccacagc 


agcaggagga 


atcaacacac 


accttcatct 


ccacaggcgg 


acgggggatc 


3120 


agcacacacc 


tccatctcca 


caggggcacg 


ggagcccgac 


tccccactcc 


cctcggggac 


3180 


acgtagtggg 


cgtgtgttaa 


atcccacgga 


agcaagctct 


ggtcactcag 


cattggtaca 


3240 


gctctgtatt 


atttgtcttg 


gaaaatcttg 


atttccggta 


tcaaagatcc 


tactttgaaa 


3300 


ataaaccaac 


gtactggtgg 


catcgtgtaa 


ttaaaattgc 


tcaattttgc 


tggcttgaaa 


3360 


tgcaggatga 


aatttagcat 


gcatgtaatc 


ttagcaatgt 


ttaaaaagta 


tactttcccc 


3420 


aaagtgcttt 


ttcaaaagcc 


ttagaaagca 


agcacgtatt 


tcttcaactg 


tagtgaattt 


3480 


gaagctactt 


taaaccaggc 


ttactggatg 


gcagcaattt 


tggttcttcc 


ttgcaatttt 


3540 


ctttctcaga 


ggcggcctcg 


gttctggctt 


ttcccagacc 


ttacctggtg 


tatctgatgg 


3600 


gctacttcta 


cagaaacaca 


gcgggcagca 


gcttggggac 


agaaatgctg 


cacacatacc 


3660 


cccacgctgc 


caaagaagga 


cacacatttc 


tcaaacagtt 


agttccaaag 


aagaaccctg 


3720 


cataacttca 


ttctatgcaa 


taaaattaaa 


taaaaagaca 


atacttctgt 


ttggatttaa 


3780 


aaacacttta 


tagtgtgttg 


tttttattga 


gcgctcacac 


ccgaaggggt 


ggcggcggat 


3840 


gctgtgggag 


tggagctggg 


aaagcagctg 


gctaagaaaa 


tagagcctga 


gcttgatggc 


3900 


agtgctcaag 


tgacctctca 


cgacgcttct 


accaatgggc 


tcatcaactt 


catcaagcag 


3960 


cagcgcgagg 


ccagagtcca 


acaaactcgt 


gctcatctgc 


agcctcctct 


gtgactcccc 


4020 


tttctcttct 


cgtccctcct 


ccccggagcc 


ggcactgcat 


gttcctggac 


accacccaga 


4080 


gcaccctctg 


gttgtgggct 


tggaccacga 


gcccttagca 


gggaaggctg 


gtctccccca 


4140 


gcctaacccc 


cagcccctcc 


atgtctatgc 


tccctctgtg 


ttagaattgg 


ctgaagtgtt 


4200 


tttgtgcagc 


tgacttttct 


gacccatgtt 


cacgttgttc 


acatcccatg 


tagaaaaata 


4260 


aagatgccac 


ggaggaggtt 


gcaggctcag 


cccctgattt 


tttttttcct 


gtgatggtgc 


4320 


tttatgtagc 


agagggcagg 


agcgctcagc 


aggacgcagg 


ctgtgcctct 


gcggacactc 


4380 


aacactaagt 


ggtgagcggg 


tctagagtgg 


agcaaggtgc 


cctgagaaga 


caatagtggg 


4440 


gtgggggcac 


aatcagtcag 


gacggcacag 


ktrcrggctc 


ayrcctgtaa 


tcycagcact 


4500 


ttgggaggcc 


raggyaggkg 


gaccacctga 


ggtcagacca 


gtctggtcaa 


catggtgaaa 


4560 


ccccgtctgt 


accaaaaaca 


caaaaaccag 


ctgggcatgg 


tggcacctgc 


ctgtggtccc 


4620 


agctactcgg 


gaggccgagg 


cagaagactc 


gctggaaccc 


gggaggcgga 


gcttgccgtg 


4680 


agccgagact 


gtgtcactgc 


accgcagcct 


gggtgacaga 


gccagactcc 


atctcaacaa 


4740 


cagacgcact 


tacacttgga 


gtgtctgtcg 


cacagggccg 


aggagcgcat 


gtcacacagg 


4800 
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cggatggtcc 


ctttgctact 


gctgtagacg 


aacacgttgc 


accggtgcgg gtggaacccg 


4860 


gctgcagtga 


tgacttcggt 


cagctcctcc atgttagcag gcttgatgtc cacgacgtta 


4920 


aagctcctat 


ctgtgatttc 


taagcgccat 


aaattaattc 


tcaggtcatc 


tgcagaaaga 


4980 


tatgtttcat 


gatcactatt 


tactgaaatg 


gaatttatac 


gatatgtgtg 


agcatttgca 


5040 


aaaattcgcc 


gtggactcgc 


ttctaccata 


agatccatgg 


gcttcaatat 


tgggacccgt 


5100 


agcgccgtga 


tcctaaatgg 


gcctcgaagt 


cttccatctt 


cgtctttcag 


gttataacct 


5160 


tctgctcttt 


tatcccgttc 


acttattttc 


cataatttta 


tagttttatc 


atttgtagac 


5220 


agtagaaaat 


gagcagcatt 


ctgttgtggt 


aaccacctaa 


Ctttattaat 


tttttcctca 


5280 


acttctagac 


ttttcaaata 


gtcaaactcc 


ggttcatgac 


tttgaaaggt 


gctgtaaaca 


5340 


ttatattctc 


ccctagaatg 


agggcggctt 


ttattctctt 


gttcacgctg 


aaaaataaca 


5400 


actctgccgc 


ccttgtctcc 


tgttgcaaga 


agatctccag 


agtaattaaa 


ctcaacggtg - 


5460 


gaaatgatgt 


ccgcttcggc 


cacgtcctcg 


tcgatggccc 


ccttgacctg 


cgagaagcac 


5520 


cactggaagt 


cgttgccgcc 


cgcggggcag 


ccgccgcctc 


cggctcctgc catggcagcc 


5580 


ggcggcggga 


cgggcgggga 


ccgcggcaga 


ccagggagac 


ggcctccgcg 


ccgctgcagc 


5640 


cgagcgctca 


gccccggggc 


cgccaccacc 


gccggcgccg 


ccgccgccgc 


cgccgccgcc 


5700 


ggggagggat 


yttyykct 










5718 


<210> 32 
<211> 1091 
<212> DNA 
<213> Homo sapien 












<400> 32 
gcctttgcgs 


gtggcggcga 


acgcggagag 


cacgccatga 


aggcctcggg 


cacgctacga 


60 


gagtacaagg 


tagtgggtcg 


ctgcctgccc 


acccccaaat 


gccacacgcc 


gcccctctac 


120 


cgcatgcgaa 


tctttgcgcc taatcatgtc gtcgccaagt 


cccgcttctg gtactttgta 


180 


tctcagttaa 


agaagatgaa 


gaagtcttca 


ggggagattg 


tctactgtgg 


gcaggtgttt 


240 


gagaagtccc 


ccctgcgggt gaagaacttc gggatctggc 


tgcgctatga 


ctcccggagc 


300 


ggcacccaca 


acatgtaccg ggaataccgg gacctgacca ccgcaggcgc 


tgtcacccag 


360 


tgctaccgag 


acatgggtgc 


ccggcaccgc 


gcccgagccc 


actccattca 


gatcatgaag 


>l O A 


gtggaggaga 


tcgcggccag 


caagtgccgc 


cggccggctg 


tcaagcagtt 


ccacgactcc 


480 


aagatcaagt 


tcccgctgcc 


ccaccgggtc 


ctgcgccgtc 


agcacaagcc 


acgcttcacc 


540 


accaagaggc 


ccaacaacct 


tctttctagg 


tgcagggccc 


tcgtccgggg 


cgtgccccca 


600 


aataaactca 


ggaacgcccc gaaggtacaa gctgctgttc cggatgactt 


taaggacctc 


660 
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agtctgctgt 


aacgaggagg 


cgcgctaata 


tcagttcaag 


accatggtga 


ggcgagcttg 


720 


gagcgctggc 


acgcatgacc 


cggagaagag 


caccggaaac 


cgggatgggg 


aacggctgga 


780 


tgccaagtca 


agcgccagaa 


gatgggccaa 


gcgggaccgg 


accacacgac 


gagcactgcc 


840 


cgcttaagaa 


gaataccaca 


gcaacgcgaa 


agcaacagta 


aggcagaaca 


aaccgaggcg 


900 


acaccagagc 


ggcgcgaaag 


aaaaaaacgc 


agcacaacga 


acacgcggca 


gcgcaatacg 


960 


cagcgcgctc 


taaagagacc 


gacagaaagc 


aacccgtcgg 


ggacaaccaa 


ggcgagacga 


1020 


agcccccggg 


acgcaaacgg 


gagggagaag 


agcgcaccgc 


aggtccgaat 


aaagagagga 


1080 


acagccgaca 


t 










1091 


<210> 33 

<211> 859 

<212> DNA 

<213> Homo sapien 












<400> 33 
acagccgtgt 


tgtcgccagg 


ccgcgccttc 


cctcccacag 


cgcgcgctgc 


gcgtgcgaag 


60 


gtctggcggc 


tcttgggact 


ggcggggctg 


cgcgcggggt 


tagggtgggg 


gtacgggaag 


120 


gctcaaccca 


ggacctgcgt 


accttgcttt 


gggggcgcac 


taagcacctg 


ccgggagcag 


180 


ggggcgcacc 


gggaactcgc 


agatttcgcc 


agttgggcgc 


actggggatc 


tgtggactgc 


240 


gtccggggga 


tgggctaggg 


ggacatgcgc 


acgctttggg 


ccttacagaa 


tgtgatcgcg 


300 


cgagggggag 


ggcgaagcgt 


ggcgggaggg 


cgaggcgaag 


gaaggagggc 


ctcgttcgtc 


360 


cggcccagcc 


tgaccaatgc 


cgcggtggaa 


acgggctcgg 


agctggcccc 


ataagggctg 


420 


gcggcttcct 


ccgacgccgc 


ccctccccac 


agcttctcga 


ctgcagtggg 


gcggggggca 


480 


ccaacacttg 


gagatttttc 


cggaggggag 


aggattttct 


aagggcacag 


agaatccatt 


540 


ttctacacat 


taacttgagc 


tgctggaggg 


acactgctgg 


caaacggaga 


cctatttttg 


600 


tacaaagaac 


ccttgacctg 


gggcgtaata 


aagatgacct 


ggacccctgc 


ccccactatc 


660 


tggagttttc 


catgctggcc 


aagatctgga 


cacgagcagt 


ccctgagggg 


cggggtccct 


720 


ggcgtgaggc 


ccccgtgaca 


gcccaccctg 


gggtgggttt 


gtgggcactg 


ctgctctgct 


780 


agggagaagc 


ctgtgtgggg 


cacacctctt 


caagggagcg 


tgaactttat 


aaataaatca 


840 



gttctgttta ccaaaaaaa 859 



<210> 34 

<211> 2236 

<212> DNA 

<213> Homo sapien 



<400> 34 

ctcaccccta agcccagccc tcggctccca cccttgtgta cctgggccga accattcacc 



60 
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ggagcgcgca 


gcgggtggag 


tgtggctcgg 


aggaccgcgg 


cgggtcaagc 


accttccccc 


120 


cccatatccg 


aaagcatgcc 


ctttgtccac 


gtcgtttacg 


ctcattaaaa 


cttccagaat 


180 


gcaacaggac 


ggacttggag 


tagggacaag 


gaacggaagt 


gggaagggga 


ggagcgtgca 


240 


cccctcctgg 


ccttggtgcg 


cgccgcgccc 


cctaaggtac 


ttcggaaggg 


acgcgcgggc 


300 


cagacgcgcc 


cagacggccg 


cgatggcgct 


gttggccggc 


gggctctcca 


gagggctggg 


360 


ctcccacccg 


gccgccgcag 


gccgggacgc 


ggtcgtcctc 


gtgtggcttc 


tgcttagcac 


420 


ctggtgcaca 


gctcctgcca 


gggccatcca 


ggtgaccgtg 


tccaacccct 


accacgtggt 


480 


gatcctcttc 


cagcctgtga 


ccctgccctg 


tacctaccag 


atgacctcga 


cccccacgca 


540 


acccatcgcc 


atctggaagt 


acaagtcttt 


ctgccgggac 


cgcatcgccg 


atgccttctc 


600 


cccggccagc 


gtcgacaacc 


agctcaatgc 


ccagctggca 


gccgggaacc 


caggctacaa 


660 


cccctacgtc 


gagtgccagg 


acagcgtgcg 


caccgtcagg 


gtcgtggcca 


ccaagcaggg 


720 


caacgctgtg 


accctgggag 


attactacca 


gggccggagg 


attaccatca 


ccggaaatgc 


780 


tgacctgacc 


tttgaccaga 


cggcgtgggg 


ggacagtggt 


gtgtattact 


gctccgtggt 


840 


ctcagcccag 


gacctccagg 


ggaacaatga 


ggcctacgca 


gagctcatcg 


tccttgggag 


900 


gacctcaggg 


gtggctgagc 


tcttacctgg 


ttctcaggcg 


gggcccatag 


aagaccggct 


960 


cttcgtggtt 


gtggtacgcc 


tggctgcctt 


cctcatcttc 


ctcctcctgg 


gcatctgctg 


1020 


gtgccagtgc 


tgcccgcaca 


cttgccgctg 


ctacgtcagg 


tgcccctgct 


gcccagacaa 


1080 


gtgctgctgc 


cccgaggccc 


tgtatgccgc 


cggcaaagca 


gccacctcag 


gtgttcccag 


1140 


catttatgcc 


cccagcacct 


atgcccacct 


gtctcccgcc 


aagaccccac 


ccccaccagc 


1200 


tatgattccc 


atgggccctg 


cctacaacgg 


gtaccctgga 


ggataccctg 


gagacgttga 


1260 


caggagtagc 


tcagctggtg 


gccaaggctc 


ctatgtaccc 


ctgcttcggg 


acacggacag 


1320 


cagtgtggcc 


tctgaagtcc 


gcagtggcta 


caggattcag 


gccagccagc 


aggacgactc 


1380 


catgcgggtc 


ctgtactaca 


tggagaagga 


gctggccaac 


ttcgaccctt 


ctcgacctgg 


1440 


cccccccagt 


ggccgtgtgg 


agcgggccat 


gagtgaagtc 


acctccctcc 


acgaggacga 


1500 


ctggcgatct 


cggccttccc 


ggggccctgc 


cctcaccccg 


atccgggacg 


aggagtgggg 


1560 


tggccactcc 


ccccggagtc 


ccaggggatg 


ggaccaggag 


cccgccaggg 


agcaggcagg 


1620 


cgggggctgg 


cgggccaggc 


ggccccgggc 


ccgctccgtg 


gacgccctgg 


acgacctcac 


1680 


cccgccgagc 


accgccgagt 


cagggagcag 


gtctcccacg 


agtaatggtg 


ggaggagaag 


1740 


ccgggcctac 


atgcccccgc 


ggagccgcag 


ccgggacgac 


ctctatgacc 


aagacgactc 


1800 


gagggacttc 


ccacgctccc 


gggaccccca 


ccacgacgac 


ttcaggtctc 


gggagcgccc 


1860 



wo 2004/050860 

tcctgccgac cccaggtccc accaccaccg 
gtccggggac ctcccctatg atgggcggct 
ggaggagagg aggagacccc acaaggagga 
gcccccgtac tcggagaccg actcgcaggc 
ggccctgagt cgggaaagct tagtcgtctg 
tcttttttta atttgaagga acactgatga 
aaacgtataa tcacaa 

<210> 35 

<211> 1612 

<212> DNA 

<213> Homo sapien 

<400> 35 

ggatacttga tttcggcttg tggggacagt 
gggcagcttt gttgccctgt aaacacacag 
agctggggag ttggaagttt aggtaaaacc 
tgctgttaaa tgaaacattg ttctgccacc 
tgaaaaatca cacaggaagt ggggtggggt 
ggatgctgca gaaatggaat gaggaccaac 
gcggagagga cggcttggga aggaggtcca 
tggggcagcc tggctcctgc ctcatccccc 
tgccgccggc aaagcagcca cctcaggtgt 
ccacctgtct cccgccaaga ccccaccccc 
caacgggtac cctggaggat accctggaga 
aggctcctat gtacccctgc ttcgggacac 
tggctacagg attcaggcca gccagcagga 
gaaggagctg gccaacttcg acccttctcg 
ggccatgagt gaagtcacct ccctccacga 
ccctgccctc accccgatcc gggatgagga 
gggatgggac caggagcccg ccagggagca 
ccgggcccgc tccgtggacg ccctggacga 
gagcaggtct cccacgagta atggtgggag 
ccgcagccgg gacgacctct atgaccaaga 



PCT/US2003/040063 











tacccgggac 


cctcgggaca 


acggctccag 


1920 


actggaggag 


gctgtgagga 


agaaggggtc 


1980 


ggaggaagag 


gcctactacc 


cgcccgcgcc 


2040 


gtcccgagag 


cgcaggctca 


agaagaactt 


2100 


atctgacgtt 


ctctacgtag 


cttttgtatt 


2160 


agccctgcca 


tacccctccc 


gagcccaaca 










2236 


ggtggaccca 


gcatctgggc 


tttatataaa 


60 


accatgggtg 


gccacttctt 


ccagtaagtt 


120 


ttttgattga 


caaatgttgg 


cgaattacca 


180 


ctggggctgt 


gggtgcctgc 


gtgcaccctc 


240 


ctctgtgaag 


ctggtgtccc 


ccagcctcag 


300 


agggactcag 


atgtccaagg 


aagctctaca 


360 


ggcccaggtc' 


cctccggaac 


ccaatgggta 


420 


ttctcctgtt 


gattgtgtcc 


tcacagtgta 


480 


ccccagcatt 


tatgccccca 


gcacctatgc 


540 


accagctatg attcccatgg gccctgccta 


600 


cgttgacagg 


agtagctcag 


ctggtggcca 


660 


ggacagcagt 


gtggcctctg 


aagtccgcag 


720 


cgactccatg 


cgggtcctgt 


actacatgga 


780 


acctggcccc 


cccagtggcc 


gtgtggagcg 


840 


ggacgactgg 


cgatctcggc 


cttcccgggg 


900 


gtggggtggc 


cactcccccc ggagtcccag 


960 


ggcaggcggg ggctggcggg ccaggcggcc 


1020 


cctcaccccg 


ccgagcaccg 


ccgagtcagg 


1080 


gagaagccgg 


gcctacatgc 


ccccgcggag 


1140 


cgactcgagg 


gacttcccac 


gctcccggga 


1200 
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cccccactac gacgacttca ggtctcggga 
ccaccgtacc cgggaccctc gggacaacgg 
gcggctactg gaggaggctg tgaggaagaa 

99^99^99^9 gaagaggcct actacccgcc 
gcaggcgtcc cgagagcgca ggctcaagaa 
cgtctgacct gacgttttct acgtagctct 
tgatgaagcc ctgccatacc cctcccgagt 

<210> 36 

<211> 6003 

<212> DNA 

<213> Homo sapien 

<400> 36 



tgacatacga 


aaggttttta 


atatctacaa 


caggatggaa 


agatctatgc 


cacgtaggaa 


ccataagcta 


tggagagaat 


aacagagttt 


agtctggtag 


gatgggaacc 


gttcttgaag 


taggaaatgc 


atacattttg 


tcattcacag 


ataaactcaa 


agaatttcaa 


gaatgatatt 


ggtttgggct 


ccctcctgcc 


atcgggtggg 


tcctctccca 


gtgcttattt 


cctgctggaa 


tcatccttat 


taaaatcaca 


caatgaagca 


ggcggacccc 


ggtccccgcc 


tttggccagc 


gagccccgaa 


gtgaggtcat 


gccagttagc 


ctaaagagtt 


tgacgcttgt 


gaaatataaa 


aaggaaatac 


actgccagtt 


tggtcaactt 


cactgccagt 


taagactgat 


cccctaactg 


cggcagtgac 


cccatttttt 


tttcaatcct 


gactgacccg 


aagcgcagtg 


ggaagggctt 


gggcagggga 


gcgcagaggt 


cagtgaaatc 


tccactggct 


ccctcctctg 


actgtaccag 


gctgttttcc 


ctgctcccgg 


ctgagtgaga 


ggtgggtggg 


gccctccggc 


agcctaagtg 
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gcgccctcct 


gccgacccca 


ggtcccacca 


1260 


ctccaggtcc 


ggggacctcc 


cctatgatgg 


1320 


ggggtcggag 


gagaggagga 


gaccccacaa 


1380 


cgcgccgccc 


ccgtactcgg 


agaccgactc 


1440 


gaacttggcc 


ctgagtcggg 


aaagtttagt 


1500 


tgtatttctt 


tttttaattt 


gaaggaacac 


1560 


ctaataaaac 


gtataatcac 


aa 


1612 


aagatgacag 


actgaaacag 


taacattctc 


60 


ttgtttgttc 


aaattattat 


tttttaaaag 


120 


tgttttgttt 


tttaatcctg 


atacggtgga 


180 


cttgagaata 


cattcggaac 


aaataaaaag 


240 


ttgtagggca 


ggtaagggca 


aaacgtactg 


300 


gtcatgagcc 


atgagccatg 


gaagctgcaa 


360 


gggtgtcggc 


ggaagggggt 


acgtgtcctg 


420 


ccgtggtttt 


gacgcagccg 


tgacagccct 


480 


ggtcgcccaa 


gattgaaagc 


gcgtttccag 


540 


aaaagtacat 


tctggcaggg 


agggatctct 


600 


agttagaaaa 


ggaggcattt 


cccacacttg 


660 


caggaagcgg 


gagctctcca 


aggcgacctg 


720 


gtgaatgcca 


gttctctagg 


tcgtcaaggg 


780 


cacccgaggc 


tactactctt 


cgcccagtct 


840 


tcccacagcc 


aggtcactca 


gcagtggggt 


900 


acagtcatct 


ctcaactccc 


cgaacttgca 


960 


cccagccagg 


ccttatttta 


gtcatgagtt 


1020 


cgctgctcac 


accgcgggat 


tggagtttca 


1080 


aggtcacaga 


gcccggacct 


ttccggggga 


1140 


agattgcaga 


gatctggggc 


cgctggagga 


1200 
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ggtccttccg 


ccccgggctt 


ggggctttgg 


gttagcctct 


cccccgcttc 


cctacctagg 


1260 


catctcggtt 


acaatgaaat 


taaaaacaaa 


acacaaagca 


acacaagttc 


tcattgtttt 


1320 


tcccctctgt 


cagccagtgg 


cctcgacaca 


attttcccag 


ggctgccttt 


cctttattgc 


1380 


tcaatttaac 


ctcttcctag 


aggttctgca 


agttcaaqgg 


caggatccgt 


gtcggatcca 


1440 


tcaactagtg 


cagcgcctag 


tgcagagcag 


ggggctcaaq 


aaatacgtgt 


cgaccgcaCg 


1500 


attgctaaga 


tttccctgta 


acaggcggtt 


tttttttctt 


gtctctctca 


tccctcctgc 


1560 


cctgggtgca 


gaccgcgaac 


agcgccgccg 


aacttcccaa 


cagqcaggga 


gagggcc^at 


1620 


ccgggctggc 


acgccacgtc 


cccggggcct 


cagtccctgg 


gtcaacgcag 


agaggctcgc 


1680 


ggtccgcggt 


gctacgacct 


gagcccggcc 


acgtqaagcq 


tcacgccctg 


cgctgctgca 


1740 


gggcaggcgc 


cgccgccgcg 


gttgtgtaac 


qctgcagccg 


gaggggagqc 

3 3 3 3 3 3 li? 


qaccgggqcq 


1800 


cctctgctcc 


ggagggggca 


gaggagggag 


gqagcgccqq 


gcacagcgcc 


cggtatcttc 


1860 


ctgcgcttga 


ctggccggga 


ggaagggggt 


gacactgggg 


cacccagggg 


gctgcgcaac 


1920 


cgggtgcccg 


cccgcccgca 


ggttgtgccc 


aggagcggag 


ctcctgggag 


cgaggggccc 


1980 


ccggctcaca 


ccctccacgg 


tcaggcgcgc 


gcggcgtgct 


cggcggcagc 


ctcagccgcc 


2040 


cctgcgcttg 


gccgtcggcc 


cgcttcccat 


ggggtgacat 


ccgccccgcc 


cctcggtccc 


2100 


tccccaaggc 


gggcaattcc 


tqqacgcgag 


ggtgagcaqt 

33 3 3 3 


ggqqqcaqqg 

33333^333 


aagccaggac 


2160 


ggaaagaaac 


cccagcctcc 


ggggaaggct 


999999ccgg 

3333^3 33 


gcgactccct 


ccgcaqcqcq 


2220 


ccggttcctc 


caagcgggcc 


gggcggggga 


ggaggaagag 


ggctgggctg 

33 333 3 


gagctagcaa 


2280 


ggggacattc 


ctctcccggc 


ttgagtcaga 


cgcgggcgga 


tccgtcctcc 


cccgttccct 


2340 


cccaggagac 


gggaacctac 


ttcatttccc 


tgggqcaggt 

^^^^^^ 


tcgcccacgt 


taccaacttc 


2400 


tccccctccc 


ccagcacccc 


cgtcccttcc 


agcttccgcg 


ccccccaccc 


aactqqqcaq 


2460 


gacccaggtc 


gtgctgccac 


cccctcttcg 


gggaaaggcg 


gccgcagccg 


cagacacctg 


2520 


ggggccgggg 


ctgggggtgg 


gggctcc.cta 


gcagccgccg 


gagcQttgtc 


caacacgtga 


2580 


gactcatgtg 


atgaagccgg 


gggagggcgg 


gcaggtcgct 


ccttccctcc 


ccagcagtqg 

^^33333 


2640 


ccagacgtgc 


ctggagtcac 


agggtagaac 


acgtagctcc 


aacccaccca 


ccgctccctc 


2700 


ccgcctcccc 


gcccgcccca 


cttctcattc 


acttggctcg 


cacqqcqcaq 


acagaccgcg 


2760 


cagggagcac 


acaccgccag 


tctgtgcgct 


gagtcggagc 


cagaggccgc 


ggggacaccg 


2820 


ggccatgcac 


gcccccaact 


gaagctgcat 


ctcaaagccg 


aagattccag 


cagcccaggg 


2880 


gatttcaaag agctcagact 


cagaggaaca 


tccgcggaga 


gacccccgaa 


gccctctcca 


2940 


gggcagtcct 


catccagacg 


ctccgctagt 


gcagacagga 


gcgcgcagtg 


gccccggctc 


3000 


gccgcgccat 


ggagcggatc 


cccagcgcgc 


aaccaccccc 


cgcctgcctg 


cccaaagcac 


3060 
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cgggactgga 


gcacggagac 


ctaccaggga 


tgtaccctgc 


ccacatgtac 


caagtgtaca 


3120 


agtcaagacg 


gggaacaaag 


cggagcgagg 


acagcaaggt 


aagcaagtgc 


acccctaggg 


3180 


accctgcgct 


cagcccctcg 


cgcgcgctga 


gtttccaaga 


aaagttttct 
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3240 


ttggcgaggg 
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gtgggctcgg 
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3300 


ggcagacgac 


ggccctgggt 


gctactctgc 
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ggggctggga 


ggataggctt 


ctcatctcct 


tccccaagcg 
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gctcatcgag 


aaaaagagac 
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3480 
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cctacccgaa 


catctcaaac 
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gcagtgagca 


gccgtgcttc 
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tcggcgcaat 
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4140 


aaagaaccgg 
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ccacctcagt 
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4380 


agacaagatc 


tcggctccct 
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ccttctccct 
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tccgtccgtc 
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tcttgctcca 
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tctaggggat 


cctgctgctt 
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gagataatct 


gaggcatgga 
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agggtgtgtg 


tgtgtgtgtg 


tgtgtgtgtg 
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tgtatgtgcg 


tgtgcgtgca 


catgtgtgcc 


tgcgtgttgg 
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taaagctcct 
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tttggcatag 


ggaagtcacg 


aaggattgct 


tgacatcagg 


agacttgggg 


gggattgtag 
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cagacgtctg 


ggcttttccc 


cacccagaga 


atagccccct 


ccgatacaca 


tcagctggat 
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1 A Q A 


gctgttttcc 


ctgctcccgg 


ctgagtgaga 


aggtcacaga 


gcccggacct 


ttccggggga 


1140 


ggtgggtggg 


gccctccggc 


agcctaagtg 


agattgcaga 


gatctggggc 


cgctggagga 


1 O A A 


ggtccttccg 


ccccgggctt 


ggggctttgg 


gttagcttct 


cccccgcttc 


tctacctagg 


1260 


catctcggtt 


acaatgaaat 


taaaaacaaa 


acacaaagca 


acacaagtcc 


tcattgtttt 


1320 


tcccctctgt 


cagccagtgg 


cctcgataca 


attttcccag 


ggctgccttt 


cctttattgc 


1380 


tcaatttaac 


ctcttcctag 


aggttctgca 


agttcaaggg 


caggatccgt 


gtcggattca 


1440 


tcaactagtg 


cagcgcctag 


tgcagagcag 


ggggctcaag 


aaatacgtgt 


cgaccgcatg 


1500 


attgctaaga 


tttccctgca 


acaggcggtt 


tttttttctt 


gtctctctca 


tccctcctgc 


1560 


cctgggtgca 


gaccgcgaac 


agcgccgccg 


aacttcccaa 


caggcaggga gagggccgat 


1620 


ccgggctggc 


acgccacgtc 


cccggggtct 


cagtccctgg 


gtcaacgcag 


agaggctcgc 


1680 
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ggtccgcggt 


gctacgacct 


gagcccggcc 


gggcaggcgc 


cgccgccgcg 


gttgtgtaac 


cctctgctcc 


ggagggggca 


gaggagggag 


ctgcgcttga 


ctggccggga 


ggaagggggt 


cgggtgcccg 


cccgcccgca 


ggttgtgccc 


tcggctcaca 


ccctccacgg 


tcaggtgcgc 


cctgcgcttg 


gccgtcggcc 


cgcttcccat 


tccccaaggc 


gggcaattcc 


tggacgcgag 


ggaaagaaac 


cccagcctct 


ggggaaggct 


ccggttcctc 


caagcgggcc 


gggcggggga 


ggggatattc 


ctctcccggc 


ttgagtcaga 


cccaggagac 


gggaacttac 


ttcatttccc 


tccccctccc 


ccagcacccc 


cgtcccttcc 


gacccaggtc 


gtgctgccac 


cccctcttcg 


ggggccgggg 


ctgggggtgg 


gggctcccta 


gactcatgtg 


atgaagccgg 


gggagggcgg 


ccagacgtgc 


ctggagtcac 


agggcagaac 


ccgcctcccc 


gcccgcccca 


cttctcattc 


cagggagcac 


acaccgccag 


tctgtgcgct 


ggccatgcac 


gcccccaact 


gaagctgcat 


gatttcaaag 


agctcagact 


cagaggaaca 


gggcagtcct 


catccagacg 


ctccgctagt 


gccgcgccat 


Sgagcggatc 


cccagcgcgc 


cgggactgga 


gcacggagac 


ctaccagggt 


aactgcagcc 


ccatgttatg 


cgccactctc 


gtccttgccc 


ggcagaacgc 


ctgcagccgt 


tcttcctgca 


ggatgtaccc 


tgcccacatg 


aagcggagcg 


aggacagcaa 


ggtaagcaag 


tcgcgcgcgc 


tgagtttcca 


agaaaagttt 


ggtgtgggct 


cggtgcctct 


tctgagtgac 
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acgtgaagcg tcacgccctg cgctgctgca 1740 

gctgcagccg gaggggaggc gaccggggcg 1800 

ggagcgccgg gcacagcgcc cggtatcttt 1860 

gacactgggg cacccagggg gctgcgcaac 1920 

aggagcggag ctcctgggag cgaggggccc 1980 

gcggcgtgct cggcggcagc ctcagctgcc 204 0 

ggggtgacat ccgccccgcc cctcggtccc 2100 

ggtgagcagc gggggcaggg aagccaggac 2160 

ggggggccgg gcgactccct ccgcagcgcg 2220 

ggaggaagag ggctgggctg gagctagcaa 2280 

cgcgggcgga tccgtcctcc cccgttccct 2340 

tggggcaggt tcgcccacgt taccaacttc 24 00 

agcttccgcg ccccccaccc aactgggcag 2460 

gggaaaggcg gccgcagccg cagacacctg 2520 

gcagccgccg gagcgttgtc caacacgtga 2580 

gcaggtcgct ccttccctcc ccggcagtgg 264 0 

acgtagctcc aacccaccca ccgctccctc 2700 

acttggctcg cacggcgcag acagaccgcg 2760 

gagtcggagc cagaggccgc ggggacaccg 2820 

ctcaaagccg aagattccag cagcccaggg 2880 

tccgcggaga gacccccgaa gccctctcca 294 0 

gcagacagga gcgcgcagtg gccccggctc 3000 

aaccaccccc cgcctgcctg cccaaagcac 3060 

aagttggcac tccttggccc ttcaagcctc 3120 

aacttggagc agctccgggc gcaccgggag 3180 

gcgcaagcag tcacgaccct tcccggtctc 3240 

taccaagtgt acaagtcaag acggggaata 3300 

tgcaccccta gggaccccgc gctcagcccc 3360 

tctcgctttg aggttggcga ggggatgcag 3420 

ttggaaaaga aagggcagac gatggttctg 34 80 
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ggtgctactc 


tgctactctg 


ctcctctgcc 


ggactgttgc 


tgcggggctg 


ggaggatagg 


3540 


cttctcatct 


ccttccccaa 


gcgcccaccg 


cctccccgtg 


cgtcttgcag 


gagacctaca 


3600 


aactgccgca 


ccggctcatc 


gagaaaaaga 


gacgtgaccg 


gattaacgag 


tgcatcgccc 


3660 


agctgaagga 


tctcctaccc 


gaacatctca 


aacttacagt 


aagtgagaag 


ctggcccctt 


3720 


tccaacccag 


tcctttgccc 


agagggcggg 


cgggtcactc 


gccgcctgca 


ggttccgcgg 


3780 


ggaactgcag 


accagactgg 


cggatcagag 


ctggcgggtg 


gctccgcggt 


ggggtcctcc 


3840 


tccacagtcc 


tggcgttttc 


tggctgactc 


atgcaaatga 


ggcgagaacc 


acttggacct 


3900 


ctccaagttg 


gctccaacac 


caactggcct 


tcctttccag 


gagtttaact 


tctgactctg 


3960 


ggtcacttgg 


aaaaagcagt 


ggttcttgaa 


cttaccttga 


agcatgtgaa 


agcactaaca 


4020 


aacctaactg 


atcagcagca 


gcagaaaatc 


attgccctgc 


agagtggttt 


acaagctggt 


4080 


gagtgctgat 


tctggctatg 


ctctctttaa 


gagtttgagt 


gcaaatagag 


agcccgtggg 


4140 


ctcaacatct 


gatgtaataa 


acattactgc 


aacaaattgc 


ttatgcacgt 


tcattggggg 


4200 


agatgcagtt 


ttatctttcc 


taaaaggtca 


gacttttcat 


ttgaaatgca 


ttgtgccacc 


4260 


tggcatgaat 


acgtattagt 


gaaatgataa 


gatattcctg 


tggcatttta 


gcttgagagt 


4320 


ctctgggttc 


tgggagattt 


ccctactccc 


ttttaaaatg 


ccaaggcacc 


tataatttgc 


4380 


attgtgggtg 


ttttacagtt 


ggtcttctta 


gccctctttc 


attggagaag 


aaatcatggg 


4440 


tatgccccag 


cgttttcctg 


tttctttgct 


tccttgtcac 


ttggcgtgtg 


gtgtgatcct 


4500 


ggcaggatac 


agcaggagtt 


tttcctgcat 


ttcatgtggt 


tccctgaata 


gccactaggt 


4560 


atcttggtga 


attgcacaat 


ctagcgaaac 


cactgaagtt 


tcatcattcg 


tttctactga 


4620 


cttccagaca 


atcggagtca 


accttctaac 


ctcctagtct 


cacttcttcc 


aaataatact 


4680 


gtacagactg 


gggagaatta 


ttctacccac 


tccctcattt 


catgcttgtc 


tgccttcctc 


4740 


tcgaaggctc 


gtatgatgaa 


aattgcaaaa 


acccagctag 


tgcttctgcc 


tgtttcctca 


4800 


ttcaaaatag 


gaagatggtc 


atacctaatg 


tcttgaggat 


catggataga 


atatgcggag 


4860 


aagacataga 


aatagcctaa 


ttaaaggtca 


cagcatgact 


gcaactttta 


caaatcattg 


4920 


aaaataagtt 


tactttaggt 


gattgttgtt 


tcttcattct 


tttcctcatt 


tcataactat 


4980 


ccagcctctg 


cccattttgc 


cacggtgcaa 


tgttgtatgc 


attgactctt 


aggtttataa 


5040 


acacaucaga 


atguucuu ug 








cacctataca 


5100 


atccaaaaga 


ggggaaacag 


ctcatctcat 


attaactatg 


gtccatttcc 


atgataggat 


5160 


attatgcaac 


cattaaaaat 


catattttac 


ctgagtgtta 


gtatatggat 


tattttaatt 


5220 


atttatactt 


ttcagtgtaa 


ttcttctctg 


agcatactac 


ttttggagtt 


aggaacacaa 


5280 


atcagtaaat 


gcattttttt 


ttttaacaaa 


aactaaagca 


cctggttttc 


tggagtatag 


5340 
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tttccaggtg 


cgccacaggc 


cttctgaaac 


gagctgtcag 


ggagaaatgt 


cgaaacaggc 


tgtgcccggg 


aggtgcttca 


gcatctggcc 


tcgcagcttg 


tcacccacct 


ccaccgggtg 


aggaagccat 


cagacccagc 


tcccaaagtg 


gccaaaggct 


cggaaggtcc 


tgggaaaaac 


cactcgagtg 


gggagcagag 


cggcagcgac 


tcggagaagg 


gcgacttgcg 


cagtgagcag 


ttcacgatgg 


gagaaaggat 


cggcgcaatt 


aagaaccgga 


tgcagctttc 


ggatgatgaa 


tccccgttcc 


tgggcccaca 


cccacaccag 


ccaccttcag 


cgactgccta 


cctgcccatg 


ccagcgctat 


acccaggcct 


caacgcctct 


gacaagatct 


cggctccctt 


gctcatgccc 


ccgtccgtcg 


actcttctgt 


cttgctccaa 


gaaaccaaag 


actaaactcc 


ccaggggatc 


cctaaaaagc 


aacaaaaaag 


tttttgtgaa 


agataatctg 


aggcatggag 


agcagattca 


gtatgtgcgt 


gtgcgtgcac 


atgtgtgcct 


ttggcatagg 


gaagtcacga 


aggactgctt 


agacgtctgg 


gcttttcccc 


acccagagaa 


ttcaaaagct 


tcaaagtctt 


ggtccgtgag 


gctttgatca 


gaaggtactt 


tcaaaagagg 


ttaggacccc 


acccttttgc 


ctttattgtc 


agcagtcaga 


ccgagctttc 


tgctaacatg 


agtggtttgg 


ggaggtttcc 


gcaggcctgc 


aatgtagttc 


cctactcagg 


ctttcgtagt 


cccttgtaca 


agctgagccg 


ccccggaggg 


gtttctaggc 


ctaaccacta 


gtacttactg 


gcggacaccc 


cgagcgagca 


ccccaaagtg 
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gttttccttc 


ggattcttcc 


ttccccaggt 


5400 


caagagacgc 


tctgctcagg 


tttccagaca 


5460 


aagcacgaga 


acactcggga 


cctgaagtct 


5520 


gtctcggagc 


tgctgcaggg 


tggtacctcc 


5580 


atggacttca 


aggaaaaacc 


cagctctccg 


5640 


tgcgtgccag 


tcatccagcg 


gactttcgct 


5700 


acggacacag 


acagtggcta 


tggaggagaa 


5760 


ccgtgctcca 


aaagtgacca 


cggaegcagg 


5820 


aagcaagagt 


ccgaagaacc 


ccccacaaaa 


5880 


ggccatttca 


ctagcagtga 


cctgatcagc 


5940 


cctcctttct 


gcctgccctt 


ctacctgatc 


6000 


ctggagaagt 


gctggtatcc 


cacctcagtg 


6060 


gccgcagccc 


tctctagctt 


catgaaccca 


6120 


cagagacccc 


cttctccctt 


gccagctcat 


6180 


gctctgaagc 


caatcccccc 


tttaaactta 


6240 


ctgctgcttt 


gctttccttc 


ctcgctactt 


6300 


tgctgcaaga 


ttgttgcatt 


gtgtatactg 


6360 


gggtgcgcgt 


gtgtgtgtgt 


gtgtgtgtgt 


6420 


gcgtgttggt 


ataggacttt 


aaagctcctt 


6480 


gacatcagga 


gacttggggg 


ggattgtagc 


6540 


tagccccctt 


cgacacacat 


cagctggatt 


6600 


tcactcttca 


gtttgggagc 


tgggtetgtg 


6660 


gctttccagg 


get cage tec 


caaccagctg 


6720 


gacgtgactc 


accagaegtc 


ggggagagag 


6780 


gggaggtagc 


aggcactggc 


atagcacggt 


6840 


tccccacccc 


tgcctcggaa 


gaataaagag 


6900 


gattagctta 


etaaggaact 


gaaaatgggc 


6960 


agggaggagt 


tccctgggct 


tctggcacct 


7020 


tgcagggaac 


caaaccaagg 


tctgagaaat 


7080 


cacaaagctg 


agtaaaaagc 


tgcccccttc 


7140 
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aaactcaat t 


ttcaattcca 


Ccctaaaacc 


ccttttaacc 


aagctcagct 


7200 


w ^ wdva^^ w 


ctaaccaagc 


cttggcaccg 


ccagatcctt 


tctgtaggct 


aattcctctt 


7260 


CI r" o a a r* a a c 


o w w 3 3 ^ z3 


tccttattgc 


taaaaaggaC 


tccgtctcct 


ccaaagaagt 


7320 


tttatttttci 


gtccagagCa 


cttgttttcc 


cgatgtgtcc 


agccagctcc 


gcagcagctt 


7380 




act a taccta 

d ^ 0 W ~J w \— 1— ^ 


attgctgatc 


gtgttttaac 


tttttctttt 


cctgttttta 


7440 






r f f ^ a f t tat. 


Wi M C4 W ^ ^ ^ ^ W4 


taaacatata 


ttatataaat 


7500 


atattaaaaa 


ggaaaatgct 


tcagatgttt 


atttgtataa 


ttacttgatt 


cacacagtga 


7560 


gaaaaaatga 


atgtattcct 


gtttttgaag 


agaagaataa 


tttttttttt 


ctctagggag 


7620 


aggtacagtg 


tttatatttt 


ggagccttcc 


tgaaggtgta 


aaattgtaaa 


tatttctatc 


7680 


tatgagtaaa 


tgttaagtag 


ttgttttaaa 


atacttaata 


aaataattct 


tttcctgtgg 


7740 


aa 












7742 


<210> 39 

<211> 5384 

<212> DNA 

<213> Homo sapien 












<400> 39 
tgacatacga 


aaggttttta 


atatctacaa 


aagatgacag 


actgaaatag 


taacattctc 


60 


caggatggaa 


agatctatgc 


cacgtaggaa 


ttgtttgttc 


aaattattat 


tttttaaaag 


120 


ccataagcta 


tggagagaat 


aacagagttt 


tgttttgttt 


tttaatcctg 


atacggtgga 


180 


agtctggtag 


gatgggaacc 


attcttaaaQ 


cttgagaata 


cattcggaac 


aaataaaaag 


240 


taggaaatgc 


atacattttg 


ttatt tacaa 


ttataaaoca 


QQtaaqQQca 


aaacgtattg 


300 


ataaactcaa 


agaattttaa 


gaatigatatti 


QtcatcraQCC 


atgagccatg 


gaagctgcaa 


360 


ggtttgggct 


ccctcctgcc 


atCqqQtQQQ 


gggtQtcggc 


ggaagggggt 


acgtgtcctg 


420 


tcctctccca 


gtgcttattt 


cctgctggaa 


ccgtggtttt 


gacgcagccg 


tgacagccct 


460 


tcatccttat 


taaaatcaca 


caatgaagca 


ggtcgcccaa 


gattgaaagc 


gcgtttccag 


54 0 


ggcggacccc 


ggtccccgcc 


tt taaccaac 


aaaagtacat 


tctggcaggg agggatctct 


600 


gagccccgaa 


gtgaggtcat 


gccagt tagc 


agttagaaaa 


ggaggcattt 


ctcacacttg 


660 


ctaaagagtt 


tgacgcttgt 


gaaatataaa 


caggaagcgg 


gagctctcca 


aggcgacctg 


ion 


aaggaaatac 


actgccagtt 


tggtcaactt 


gtgaatgcca 


gttctctagg 


tcgtcaaggg 


780 


cactgccagt 


caagactgat 


cccctaactg 


cacccgaggc 


tactactctt 


cgcccagtct 


84 0 


cggcagtgac 


cccattcttt 


tttcaatcct 


tcccacagcc aggtcactca gcagtggggt 


900 


gactgacccg 


aagcgcagtg ggaagggctt 


acagtcatct 


ctcaactccc 


cgaacttgca 


960 
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gggcagggga 


gcgcagaggt 


cagtgaaatc 


cccagccagg 


ccttatttta 


gtcatgagtt 


1020 


tccactggct 


cccccctctg 


actgtatcag 


cgctgctcac 


accgcgggat 


tggagtttca 


1080 


gctgttttcc 


ctgctcccgg 


ctgagtgaga 


aggtcacaga 


gcccggacct 


ttccggggga 


1140 


ggtgggtggg 


gccctccggc 


agcctaagcg 


agattgcaga 


gatctggggc 


cgctggagga 


1200 


ggtccttccg 


ccccgggctt 


ggggctttgg 


gttagcctct 


cccccgcttc 


tctacctagg 


1260 


catctcggtt 


acaatgaaat 


taaaaacaaa 


acacaaagca 


acacaagttc 


tcattgtttt 


1320 


tcccctctgt 


cagccagtgg 


cctcgataca 


attttcccag 


ggctgccttt 


cctttattgc 


1380 


tcaatttaac 


ctcttcctag 


aggttctgca 


agttcaaggg 


caggatccgt 


gtcggattca 


1440 


tcaactagtg 


cagcgcctag 


tgcagagcag 


ggggctcaag 


aaatacgtgt 


cgaccgcatg 


1500 


atcgccaaga 


tttccctgta 


acaggcggtt 


tttctttctt 


gtctctctca 


tccctcctgc 


1560 


cctgggtgca 


gaccgcgaac 


agcgccgccg 


aacttcccaa 


caggcaggga 


gagggccgat 


1620 


ccgggctggc 


acgccacgtc 


cccggggtct 


cagtccctgg 


gtcaacgcag 


agaggctcgc 


1680 


ggtccgcggt 


gctacgacct 


gagcccggcc 


acgtgaagcg 


tcacgccccg 


cgctgctgca 


1740 


gggcaggcgc 


cgccgccgcg 


gttgtgtaac 


gccgcagccg 


gaggggaggc 


gaccggggcg 


1800 


cctctgctcc 


ggagggggca 


gaggagggag 


ggagcgccgg 


gcacagcgcc 


cggtatcttt 


1860 


ctgcgcttga 


ctggccggga 


ggaagggggt 


gacactgggg 


cacccagggg 


gctgcgcaac 


1920 


cgggtgcccg 


cccgcccgca 


ggttgtgccc 


aggagcggag 


ctcctgggag 


cgaggggccc 


1980 


tcggctcaca 


ccctccacgg 


tcaggtgcgc 


gcggcgtgct 


cggcggcagc 


ctcagctgcc 


2040 


cctgcgcttg 


gccgtcggcc 


cgcttcccat 


ggggtgacat 


ccgccccgcc 


cctcggtccc 


2100 


tccccaaggc 


gggcaattcc 


tggacgcgag 


ggtgagcagt 


gggggcaggg 


aagccaggac 


2160 


ggaaagaaac 


cccagcctct 


ggggaaggct 


ggggggCCgg 


gcgactccct 


ccgcagcgcg 


2220 


ccggttcctc 


caagcgggcc 


gggcggggga 


ggaggaagag 


ggctgggctg 


gagctagcaa 


2280 


ggggatattc 


ctctcccggc 


ttgagtcaga 


cgcgggcgga 


tccgtcctcc 


cccgttccct 


2340 


cccaggagac 


gggaacttac 


ttcatttccc 


tggggcaggt 


tcgcccacgt 


taccaacttc 


2400 


tccccctccc 


ccagcacccc 


cgtcccttcc 


agcttccgcg 


ccccccaccc 


aactgggcag 


2460 


gacccaggtc 


gtgctgccac 


cccctcttcg 


gggaaaggcg 


gccgcagccg 


cagacacctg 


2520 


ggggccgggg 


ctgggggtgg 


gggctcccta 


gcagccgccg 


gagcgttgtc 


caacacgtga 


2580 


gactcatgtg 


atgaagccgg 


gggagggcgg 


gcaggtcgct 


ccttccctcc 


ccggcagtgg 


2640 


ccagacgtgc 


ctggagtcac 


agggtagaac 


acgtagctcc 


aacccaccca 


ccgctccctc 


2700 


ccgcctcccc 


gcccgcccca 


cttctcattc 


acttggctcg 


cacggcgcag 


acagaccgcg 


2760 


cagggagcac 


acaccgccag 


tctgtgcgct 


gagtcggagc 


cagaggccgc 


ggggacaccg 


2820 
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ggccatgcac 


gcccccaact 


gaagctgcat 


ctcaaagccg aagattccag 


cagcccaggg 


2880 


gatttcaaag 


agctcagact 


cagaggaaca 


tctgcggaga gacccccgaa 


gccctctcca 


2940 


gggcagtcct 


catccagacg 


ctccgctagt 


gcagacagga gcgcgcagtg 


gccccggctc 


3000 


gccgcgccat 


ggagcggatc 


cccagcgcgc 


aaccaccccc cgcctgcctg 


cccaaagcac 


3060 


cgggactgga 


gcacggagac 


ctaccaggga 


tgtaccctgc ccacatgtac 


caagtgtaca 


3120 


agtcaagacg 


gggaataaag 


cggagcgagg 


acagcaagga gacctacaaa 


ttgccgcacc 


3180 


ggctcatcga 


gaaaaagaga 


cgtgaccgga 


ttaacgagtg catcgcccag 


ctgaaggatc 


3240 


tcctacccga 


acacctcaaa 


cttacaactt 


tgggtcactc ggaaaaagca 


gtggttcttg 


3300 


aacttacctt 


gaagcatgtg 


aaagcactaa 


caaacctaat tgatcagcag 


cagcagaaaa 


3360 


tcattgccct 


gcagagtggt 


ttacaagctg 


gtgagctgtc agggagaaat 


gtcgaaacag 


3420 


gtcaagagat 


gttctgctca 


ggtttccaga 


catgtgcccg ggaggtgctt 


cagtatctgg 


3480 


ccaagcacga 


gaacactcgg gacctgaagt 


cttcgcagct tgtcacccac 


ctccaccggg 


3540 


tggtctcgga 


gctgctgcag ggtggtacct 


ccaggaagcc atcagaccca 


gctcccaaag 


3600 


tgatggactt 


caaggaaaaa 


cccagctctc 


cggccaaagg ttcggaaggt 


cctgggaaaa 


3660 


actgcgtgcc 


agtcatccag 


cggactttcg 


ctcactcgag tggggagcag 


agcggcagcg 


3720 


acacggacac 


agacagtggc 


tatggaggag 


aatcggagaa gggcgacttg 


cgcagtgagc 


3780 


agccgtgctt 


caaaagtgac 


cacggacgca 


ggttcacgat gggagaaagg 


atcggcgcaa 


3840 


ttaagcaaga 


gtccgaagaa 


ccccccacaa 


aaaagaaccg gatgcagctt 


tcggatgatg 


3900 


aaggccattt 


cactagcagt 


gacctgatca 


gctccccgtt cctgggccca 


cacccacacc 


3960 


agcctccttt 


ctgcctgccc 


ttctacctga 


tcccaccttc agcgactgcc 


tacctgccca 


4020 


tgctggagaa 


gtgctggtat 


cccacctcag 


tgccagtgct atacccaggc 


ctcaacgcct 


4080 


ctgccgcagc 


cctctctagc 


ttcatgaacc 


cagacaagat ctcggctccc 


ttgctcatgc 


4140 


cccagagact 


cccttctccc 


ttgccagctc 


atccgtccgt cgactcttct 


gtcttgctcc 


4200 


aagccctgaa 


gccaatcccc 


cctttaaact 


tagaaaccaa agactaaact 


ctctagggga 


4260 


tcctgctgct 


ttgctttcct 


tcctcgctac 


ttcctaaaaa gcaacaaaaa 


agtttttgtg 


4320 


aatgctgcaa 


gattgttgca 


ttgtgtatac 


tgagataatc tgaggcatgg 


agagcagatt 


4380 


cagggtgtgt 


gtgtgtgtgt 


gtgtgtgtgt 


gtgtatgtgc gtgtgcgtgc 


acatgtgtgc 


4440 


ctgcgtgttg 


gcataggact 


ttaaagctcc 


ttttggcata gggaagtcac 


gaaggattgc 


4500 


ttgacatcag 


gagacttggg ggggattgta 


gcagacgtct gggcttttcc 


ccacccagag 


4560 


aatagccccc 


ttcgatacac 


atcagctgga 


ttttcaaaag cttcaaagtc 


ttggtctgtg 


4620 
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agtcaccctt 


cagtttggga 


gctgggtctg 


49 

tggctttgat 


cagaaggtac 


tttcaaaaga 


4680 


gggctttcca 


gggctcagct 


cccaaccagc 


tgttaggacc 


ccaccctttt 


gcctttattg 


4740 


tcgacgtgac 


tcaccagacg 


tcggggagag 


agagcagtca 


gaccgagctt 


tctgctaaca 


4800 


tggggaggta 


gcaggcactg 


gcatagcacg 


gtagtggttt 


ggggaggttt 


ccgcaggtct 


4oo0 


gctccccacc 


cctgcctcgg 


aagaataaag 


agaatgtagt 


tccctactca 


ggctttcgta 


4920 


gtgattagct 


tactaaggaa 


ctgaaaatgg 


gccccttgta 


caagctgagc 


cgccccggag 


4 980 


ggagggagga 


gttccctggg 


cttctggcac 


ctgtttctag 


gcctaaccat 


cagtacttac 


5040 


tgtgcaggga 


accaaaccaa 


ggtctgagaa 


atgcggacac 


cccgagcgag 


caccccaaag 


5100 


cgcacaaagc 


tgagtaaaaa 


gctgccccct 


tcaaacagaa 


ctagactcag 


ttttcaatCc 


5160 


catcctaaaa 


ctccttttaa 


ccaagcttag 


cttctcaaag 


gcctaaccaa 


gccttggcac 


5220 


cgccagatcc 


tttctgtagg 


ctaattcctc 


ttgcccaacg 


gcatatggag 


tgtccttatt 


5280 


gctaaaaagg 


attccgtctc 


cttcaaagaa 


gttttatttt 


tggtccagag 


tacttgtttt 


5340 


cccgatgtgt 


ccagccagaa 


ccacaaatat 


cggggtacga 


gctc 




5384 


<210> 40 

<211> 5356 

<212> DNA 

<213> Homo sapien 












<400> 40 
tgacatacga 


aaggttttta 


atatctacaa 


aagatgacag 


actgaaatag 


taacattctc 


60 


caggatggaa 


agatctatgc 


cacgtaggaa 


ttgtttgttc 


aaatcattat 


tttttaaaag 


120 


ccataagcta 


tggagagaat 


aacagagttt 


tgttttgtct 


tttaatcctg 


atacggtgga 


180 


agtctggtag 


gatgggaacc 


gttcttgaag 


cttgagaata 


cattcggaac 


aaataaaaag 


240 


taggaaatgc 


atacattttg 


ctatttacag 


ttgtagggca 


ggtaagggca 


aaacgtattg 


300 


ataaactcaa 


agaattttaa 


gaatgatatt 


gtcatgagcc 


atgagccatg 


gaagctgcaa 


360 


ggtttgggct 


ccctcctgcc 


atcgggtggg 


gggtgtcggc 


ggaagggggt 


acgtgtcctg 


420 


tcctctccca 


gtgcttattt 


cctgctggaa 


ccgtggtttt 


gacgcagccg 


tgacagccct 


480 


tcatccttat 


taaaatcaca 


caatgaagca 


ggtcgcccaa 


gattgaaagc 


gcgtttccag 


540 


ggcggacccc 


ggtccccgcc 


tttggccagc 


aaaagtacat 


tctggcaggg 


agggatctct 


600 


gagccccgaa 


gtgaggtcat 


gccagttagc 


agttagaaaa 


ggaggcattt 


ctcacacttg 


660 


ctaaagagtt 


tgacgcttgt 


gaaatacaaa 


caggaagcgg 


gagctctcca 


aggcgacctg 


720 


aaggaaatac 


actgccagtt 


tggtcaactt 


gtgaatgcca 


gttctctagg 


tcgtcaaggg 


780 


cactgccagt 


taagactgat 


cccctaactg 


cacccgaggc 


tactactctt 


cgcccagtct 


840 
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cggcagtgac 


cccatttttt 


tctcaatcct tcccacagcc aggtcactca 


gcagtggggt 


900 


gactgacccg 


aagcgcagtg 


ggaagggctt acagtcatct ctcaactccc 


cgaacttgca 


960 


gggcagggga gcgcagaggt 


cagtgaaatc cccagccagg ccttatttta gtcatgagct 


1020 


tccactggct 


ccctcctctg 


actgtatcag cgctgctcac accgcgggat 


tggagtttca 


1080 


gctgttttcc 


ctgctcccgg 


ctgagtgaga aggtcacaga gcccggacct 


ttccggggga 


1140 


ggtgggtggg gccctccggc agcctaagtg agattgcaga gatctggggc 


cgctggagga 


1200 


ggtccttccg 


ccccgggctt 


ggggctttgg gttagcttct cccccgcttc 


tctacctagg 


1260 


catctcggtt 


acaatgaaat 


taaaaacaaa acacaaagca acacaagttc 


tcattgtttt 


1320 


tcccctctgt 


cagccagtgg 


cctcgataca attttcccag ggctgccttt 


cctttattgc 


1380 


tcaatttaac 


ctcttcctag 


aggttctgca agttcaaggg caggatccgt 


gtcggattca 


1440 


tcaactagtg 


cagcgcctag 


tgcagagcag ggggctcaag aaatacgtgt 


cgaccgcatg 


1500 


attgctaaga 


tttccctgta 


acaggcggtt tttttttctt gtctctctca 


tccctcctgc 


1560 


cctgggtgca 


gaccgcgaac 


agcgccgccg aacttcccaa caggcaggga gagggccgat 


1620 


ccgggctggc 


acgccacgtc 


cccggggtct cagtccctgg gtcaacgcag 


agaggctcgc 


1680 


ggtccgcggt 


gctacgaccc 


gagcccggcc acgtgaagcg tcacgccctg 


cgctgctgca 


1740 


gggcaggcgc 


cgccgccgcg 


gttgtgtaac gctgcagccg gaggggaggc 


gaccggggcg 


1800 


cctctgctcc 


ggagggggca 


gaggagggag ggagcgccgg gcacagcgcc 


cggtatcttt 


1860 


ctgcgcttga 


ctggccggga 


ggaagggggt gacactgggg cacccagggg 


gctgcgcaac 


1920 


cgggtgcccg 


cccgcccgca 


ggttgtgccc aggagcggag ctcctgggag 


cgaggggccc 


1980 


tcggctcaca 


ccctccacgg 


tcaggtgcgc gcggcgtgct cggcggcagc 


ctcagctgcc 


2040 


cctgcgcttg gccgtcggcc 


cgcttcccat ggggtgacat ccgccccgcc 


cctcggtccc 


2100 


tccccaaggc 


gggcaattcc 


tggacgcgag ggtgagcagt gggggcaggg 


aagccaggac 


2160 


ggaaagaaac 


cccagcctct ggggaaggct ggggggccgg gcgactccct 


ccgcagcgcg 


2220 


ccggttcctc 


caagcgggcc 


gggcggggga ggaggaagag ggctgggctg gagctagcaa 


2280 


ggggatattc 


ctctcccggc 


ttgagtcaga cgcgggcgga tccgtcctcc 


cccgttccct 


2340 


cccaggagac 


gggaacttac 


ttcatttccc tggggcaggt tcgcccacgt 


taccaacttc 


2400 


tccccctccc 


ccagcacccc 


cgtcccttcc agcttccgcg ccccccaccc 


aactgggcag 


2460 


gacccaggtc 


gtgctgccac 


cccctcttcg gggaaaggcg gccgcagccg 


cagacacctg 


2520 


ggggccgggg 


ctgggggtgg gggctcccta gcagccgccg gagcgttgtc caacacgtga 


2580 


gactcatgtg 


atgaagccgg 


gggagggcgg gcaggtcgct ccttccctcc 


ccggcagtgg 


2640 


ccagacgtgc 


ctggagtcac 


agggtagaac acgtagctcc aacccaccca 


ccgctccctc 


2700 
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ccgcctcccc 


gcccgcccca 


cttctcattc 


acttggctcg 


cacggcgcag 


acagaccgcg 


2760 


cagggagcac 


acaccgccag 


tctgtgcgct 


gagtcggagc 


cagaggccgc 


ggggacaccg 


2820 


ggccatgcac 


gcccccaact 


gaagctgcat 


ctcaaagccg 


aagattccag 


cagcccaggg 


2880 


gactccaaag 


agctcagact 


cagaggaaca 


tctgcggaga 


gacccccgaa 


gccctctcca 


2940 


gggcagtcct 


catccagacg 


ctccgccagt 


gcagacagga 


gcgcgcagtg 


gccccggctc 


3000 


gccgcgccat 


ggagcggatc 


cccagcgcgc 


aaccaccccc 


cgcctgcctg 


cccaaagcac 


3060 


cgggactgga 


gcacggagac 


ctaccaggga 


tgtaccctgc 


ccacatgtac 


caagtgtaca 


3120 


agtcaagacg 


gggaataaag 


cggagcgagg 


acagcaagga 


gacctacaaa 


ttgccgcacc 


3180 


ggctcatcga 


gaaaaagaga 


cgtgaccgga 


ttaacgagtg 


catcgcccag 


ctgaaggatc 


3240 


tcctacccga 


acatctcaaa 


cttacaacct 


tgggtcactt 


ggaaaaagca 


gtggttcttg 


3300 


aacttacctt 


gaagcatgtg 


aaagcactaa 


caaacctaat 


tgatcagcag 


cagcagaaaa 


3360 


tcattgccct 


gcagagtggt 


ttacaagctg 


gtgagctgtc 


agggagaaat 


gtcgaaacag 


3420 


gccaagagat 


gttctgctca 


ggtttccaga 


catgtgcccg 


ggaggtgctt 


cagtatctgg 


3480 


ccaagcacga 


gaacactcgg 


gacctgaagt 


cttcgcagct 


tgtcacccac 


ctccaccggg 


3540 


tggtctcgga 


gctgctgcag 


ggtggtacct 


ccaggaagcc 


atcagaccca 


gctcccaaag 


3600 


tgatggactt 


caaggaaaaa 


cccagctctc 


cggccaaagg 


ctcggaaggc 


cctgggaaaa 


3660 


actgcgtgcc 


agtcatccag 


cggactttcg 


ctcactcgag 


tggggagcag 


agcggcagcg 


3720 


acacggacac 


agacagtggc 


tatggaggag 


aatcggagaa 


gggcgacttg 


cgcagtgagc 


3780 


agccgtgctt 


caaaagtgac 


cacggacgca 


ggttcacgat 


gggagaaagg 


atcggcgcaa 


3840 


ttaagcaaga 


gtccgaagaa 


ccccccacaa 


aaaagaaccg 


gatgcagctt 


tcggatgatg 


3900 


aaggccattt 


cactagcagt 


gacctgatca 


gctccccgtt 


cctgggccca 


cacccacacc 


3960 


agcctccttt 


ctgcctgccc 


ttctacctga 


tcccaccttc 


agcgactgcc 


tacctgccca 


4020 


tgctggagaa 


gtgctggtat 


cccacctcag 


tgccagtgct 


atacccaggc 


ctcaacgcct 


4080 


ctgccgcagc 


cctctccagc 


ttcatgaacc 


cagacaagat 


ctcggctccc 


ttgctcatgc 


4140 


cccagagact 


cccttctccc 


ttgccagctc 


atccgtccgt 


cgactcttct 


gtcttgctcc 


4200 


aagctctgaa 


gccaatcccc 


cctttaaact 


tagaaaccaa 


agactaaact 


ctctagggga 


4260 


tcctgctgct 


ttgctttcct 


tcctcgctac 


ttcctaaaaa 


gcaacaaaaa 


agtttttgtg 


4320 


aatgctgcaa 


gattgttgca 


ttgtgtatac 


tgagataatc 


tgaggcatgg 


agagcagatt 


4380 


cagggtgtgt 


gtgtgtgtgt 


gtgtgtgtgt 


gtgtatgtgc 


gtgtgcgtgc 


acatgtgtgc 


4440 


ctgcgtgttg 


gtataggact 


ttaaagctcc 


ttttggcata 


gggaagtcac 


gaaggattgc 


4500 
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ttgacatcag 


gagacttggg 


ggggattgta 


aatagccccc 


Ctcgacacac 


atcagctgga 


agtcactctt 


cagttcggga 


gctgggtctg 


gggctttcca 


gggctcagct 


cccaaccagc 


tcgacgtgac 


tcaccagacg 


tcggggagag 


tggggaggta 


gcaggcactg 


gcatiagcacg 


gctccccacc 


cctgcctcgg 


aagaataaag 


gtgattagct 


tactaaggaa 


ctgaaaatgg 


ggagggagga 


gttccctggg 


cttcCggcac 


tgtgcaggga 


accaaaccaa 


gguCugagaoi 


tgcacaaagc 


tgagtaaaaa 


gctgccccct 


catcctaaaa 


ctccttttaa 


ccaagcttag 


cgccagatcc 


tttctgtagg 


ctaattcctc 


gctaaaaagg 


attccgtctc 


cttcaaagaa 


gcgaccacgm 


taagcc 





<210> 41 

<211> 2176 

<212> DNA 

<213> Homo sapien 



<400> 41 



cacgtgctcg 


agctgcttcc 


cgggcagctg 


agcgggcagt 


gtgctggggt 


aaaggacctg 


ccgggatcag 


gtgctgcgaa 


gctgcctgtg 


agcttctgtc 


ggtagtactt 


gtacccttgg 


tggttctcac 


ggttgttctg 


gagggcaatg 


aacctggcca 


actcttctgc 


aaggttcttg 


agcaggctgt 


cagtactggg 


ctcaactcac 


ccttctacct 


gatcccacct 


tcagcgactg 


atcccacctc 


agtgccagtg 


ctatacccag 


gcttcatgaa 


cccagacaag 


atctcggctc 


ccttgccagc 


tcatccgtcc 


gtcgactctt 


cccctttaaa 


cttagaaacc 


aaagactaaa 
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gcagacgtct 


gggcttttcc 


ccacccagag 


4560 


ttttcaaaag 


cttcaaagtc 


ttggtctgtg 


4620 


tggctttgat 


cagaaggtac 


tttcaaaaga 


4680 


tgttaggacc 


ccaccctttt 


gcctttattg 


4740 


agagcagtca 


gaccgagctt 


tctgctaaca 


4800 


gtagtggttt 


ggggaggttt 


ccgcaggtct 


4860 


agaatgtagt 


tccctactca 


ggctttcgta 


4920 


gccccttgta 


caagctgagc 


tgccccggag 


4980 


ctgtttctag 


gcctaaccat 


tagtacttac 


5040 


atgcggacac 


cccgagcgag 


caccccaaag 


5100 


tcaaacagaa 


ctagactcag 


ttttcaattc 


5160 


cttctcaaag 


gcctaaccaa 


gccttggcac 


5220 


ttgcccaacg 


gcatatggag 


tgtccttatt 


5280 


gttttatttt 


tggtccagag 


tacctcggcc 


5340 








5356 


gagcaggatg 


atagtgggcc 


aggcgtgacg 


60 


acaggcctca 


ggagggactt 


gcgcttcagg 


120 


gagccggcct 


tggctttccg 


gaactcctcc 


180 


ctattgggct 


catacagaaa 


gctgaatgcc 


240 


gtttccacct 


cgggaccccc 


gtccgctatg 


300 


acctcttcgt 


cctctggggg 


tgaaacttta 


360 


cacacccaca 


ccagcctcct 


ttctgcctgc 


420 


cctacctgcc 


catgctggag 


aagtgctggt 


480 


gcctcaacgc 


ctctgccgca 


gccctctcta 


540 


ccttgctcat 


gccccagaga 


ctcccttctc 


600 


ctgtcttgct 


ccaagctctg 


aagccaatcc 


660 


ctctctaggg 


gatcctgctg 


ctttgctttc 


720 



wo 2004/050860 PCT/US2003/040063 
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cttcctcgct 


acttcctaaa 


aagcaacaaa 


aaagctcttg 


tgaatgctgc 


aagattgttg 


780 


cattgtgtat 


actgagataa 


tctgaggcat 


ggagagcaga 


ttcagggtgt 


gtgtgtgtgt 


840 


gtgtgtgtgt 


gtgtgtatgt 


gcgtgtgcgt 


gcacatgtgt 


gcctgcgtgt 


tggtatagga 


900 


ctttaaagct 


ccttttggca 


tagggaagtc 


acgaaggatt 


gcttgacatc 


aggagacttg 


960 


9999993 ttg 


tagcagacgt 


ctgggctttt 


ccccacccag 


agaatagccc 


ccttcgatac 


1020 


acatcagctg 


gattttcaaa 


agcctcaaag 


tcttggtctg 


tgagtcactc 


ttcagtttgg 


1080 


gagctgggtc 


tgtggctttg 


atcagaaggt 


actttcaaaa gagggctttc 


cagggctcag 


1140 


ctcccaacca 


gctgttagga 


ccccaccctt 


ttgcctttat 


tgtcgacgtg 


actcaccaga 


1200 


cgtcggggag 


agagagcagt 


cagaccgagc 


tttctgctaa 


catggggagg 


tagcaggcac 


1260 


tggcatagca 


cggtagtggt 


ttggggaggt 


ttccgcaggt 


ctgctcccca 


cccctgcctc 


1320 


ggaagaataa 


agagaatgta 


gttccctact 


caggctttcg 


tagtgattag 


cttactaagg 


1380 


aactgaaaat 


gggccccttg 


tacaagctga 


gctgccccgg 


agggagggag 


gagttccctg 


1440 


ggcttctggc 


acctgtttct 


aggcctaacc 


attagtactt 


actgtgcagg 


gaaccaaacc 


1500 


aaggtctgag 


aaatgcggac 


accccgagcg 


agcaccccaa 


agtgcacaaa 


gctgagtaaa 


1560 


aagctgcccc 


cttcaaacag 


aactagactc 


agttttcaat 


tccatcctaa 


aactcctttt 


1620 


aaccaagctt 


agcttctcaa 


aggcctaacc 


aagccttggc 


accgccagat 


cctttctgta 


1680 


ggccaattcc 


tcttgcccaa 


cggcatatgg 


agtgtcctta 


ttgctaaaaa 


ggattccgtc 


1740 


tccttcaaag 


aagttttatt 


tttggtccag 


agtacctgtt 


ttcccgatgt 


gtccagccag 


1800 


ctccgcagca 


gcttttcaaa 


atgcactatg 


cctgattgct 


gatcgtgttt 


taactttttc 


1860 


ttttcctgtt 


tttattttgg 


tattaagtcg 


ttgcctttat 


ttgtaaagct 


gttataaata 


1920 


tatattatat 


aaatatatta 


aaaaggaaaa 


tgtttcagat 


gtctatttgt 


ataattactt 


1980 






ataaatatat 

Ci L« ^ u C-* ^ k** a ^ 


ccctgtcttt 


gaagagaaga 


ataatttttt 


2040 


ttttctctag 


ggagaggtac 


agtgtttata 


ttttggagcc 


ttcctgaagg 


tgtaaaattg 


2100 


taaatattct 


tatctatgag 


taaatgttaa 


gtagttgttt 


taaaatactt 


aataaaataa 


2160 


ttcttttcct 


gtggaa 










2176 


<210> 42 
<211> 1047 
<212> DNA 
<213> Homo sapien 












<400> 42 
tgatctaagg 


ccaccctctc 


ggggagggag 


ttggggaagc 


tgggttggct 


gggttggtag 


60 


ctcctaccca 


ctgtgtggca 


agaaggtatg 


ggtcatgaac 


agaaccaagg 


agctgcgctg 


120 



wo 2004/050860 PCT/US2003/040063 



180 
240 



600 

660 



54 

ctacagatgt taccacttct gtggctgcta ccccactcct gggccgtccc tgaagctcct 
actccaatgt ggccagatga cctgcaaaac cacacattcc tgcacacagt gcactgccag 

gatgggagtc ccagtgtggg actctctgag gcctacgacg aggaccagct tttcttcttc 300 

gactcttccc agaacactcg ggtgcctcgc ctgcccgaat ttgctgactg ggctcaggaa 360 

cagggagatg ctcctgccat tttatttgac aaagagttct gcgagtggat gatccagcaa 420 

atagggccaa aacttgatgg gaaaatcccg gtgtccagag ggtttcctat cgctgaagtg 480 

ttcacgctga agcccctgga gtttggcaag cccaacactt tggtctgttt tgtcagtaat 54 0 
ctcttcccac ccatgctgac agtgaactgg cagcatcatt ccgtccctgt ggaaggactt 
gggcctactt ttgtctcagc cgtcgatgga ctcagcttcc aggccttttc tcacctaaac 

ttcacaccag aaccttctga cattttctcc tgcattgtga ctcacgaaat tgaccgctac 720 

acagcaattg cctattgggt accccggaac gcactgccct cagatctgct ggagaatgtg 780 

ctgtgtggcg tggcctttgg cctgggtgtg ctgggcatca tcgtgggcat tgctctcatc 840 

atctacttcc ggaagccttg ctcaggtgac tgattcttcc agaccagagt ttgatgccag 900 

cagcttcgga catccctggg gtgtgtgtgt agatttccca cctggggact ctgctgtccc 960 

tgggcttgca tcccagggat cccagagtgg cctgcctatc acaaccacat cccttccccc 1020 

cacaaggcaa taaatctcat ttcttta 1047 

<210> 43 

<211> 871 

<212> DNA 

<213> Homo sapien 

<400> 43 

ccctgatcgc cgatcacctc tgagaccacc ttgctcataa acaaaatgcc catgttggtc 60 

ctctgccctg gacctgtgac attctggact atttctgtgt ttatttgtgg ccgagctcga 120 

gccggccatt ttatttgaca aagagttctg cgagtggatg atccagcaaa tagggccaaa 180 

acttgatggg aaaatcccgg tgtccagagg gtttcctatc gctgaagtgc tcacgctgaa 24 0 

gcccctggag tttggcaagc ccaacacttt ggtctgtttt gtcagtaatc tcttcccacc 300 

catgctgaca gtgaactggc agcatcattc cgtccctgtg gaaggatttg ggcctacttt 360 

tgtctcagct gtcgatggac tcagcttcca ggccttttct tacttaaact tcacaccaga 420 

accttctgac attttctcct gcattgtgac tcacgaaatt gaccgctaca cagcaattgc 480 

ctattgggta ccccggaacg cactgccctc agatctgctg gagaatgtgc tgtgtggcgt 540 

ggcctttggc ctgggtgtgc tgggcatcat cgtgggcatt gttctcatca tctacttccg 600 

gaagccttgc tcaggtgact gattcttcca gaccagagtt tgatgccagc agcttcggcc 660 
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atccaaacag 


aggatgctca 


gatttcccac 


55 

atcctgccca 


ggatctcctc 


ttagggcaga 


720 


agaagtctct 


gggacatccc 


tggggtgtgt 


gtgtagattt 


cccacctggg 


gactctgctg 


780 


tccctgggct 


tgcaccccag 


ggatcccaga 


gtggcctgcc 


tatcacaacc 


acatcccttc 


840 


cccccacaag gcaataaatc 


tcatttcttt 


a 






871 


<210> 44 
<211> 2754 
<212> DNA 
<213> Homo sapien 












< ** U U > ^ "i 

gcgggatggc 


aggggcgagc 


tccacgccct 


gtccccgtcc 


aagctaccac 


ctctactccc 


DU 


accaggctgg 


gaaccagggc 


ttccttggtt 


gtggtcagca 


atgagtctgg 


atgacaattt 


ion 
120 


gtcgggcacg 


agcggtacgg 


aagtggacga 


ccgcgtgtcg 


gcgctggagc 


agcggctgca 


180 


gctacaggaa 


gacgagctgg 


cggtcctaaa 


ggcggcgctg 


gcggatgctc 


tgcgtcgcct 


240 


gcgggcatgc 


gaagaacagg 


gagcggcgcc 


acgcgcgcgg 


ggcaccccca 


2i999ccgggc 


3 00 


gcctccgcgc 


ttaggcacca 


ctgcctcggt 


gtgtcagctc 


ttgaaaggcc 


ttcccaccag 


360 


gacgcccctt 


aatggctcgg 


gacccccgcg 


gcgcgtgggt 


ggctatgcca 


cgtccccatc 


420 


ctctcccaag 


aaggaggcga 


cctccgggcg 


cagcagtgtc 


cgccgctacc 


tgtcaccaga 


4 80 


gcgcctcgcc 


tcggtgcgcc 


gtgaggaccc 


ccgcagccgg 


accacatcct 


ccagcagcaa 


54 0 


ctgtagcgcc 


aaaaaggaag 


gcaaaaccaa 


agaagttatc 


cccagcgtgg 


aggatggctc 


600 


cgtgaaaatg 


ttcctgaggg 


gccgccctgt 


gcccatgacg 


atcccagacg 


agctggcacc 




cacctacagc 


ctggacacac 


gctcggagct 


gccttcttgc 


cggctcaagc 


tggagtgggt 


•7 o 

720 


ctatggctac 


cgtggccgag 


actgccgggc 


caacct ttat 


tcgctgccca 


ccggggagat 


78 0 


agtgtacttc 


gtggcctccg 


tagccgtgct 


atacagcgtg 


gaggagcaga 


ggcagcgaca 


840 


ctacctggga 


cacaacgatg 


acaccaaatg 


cttggccatc 


cacccagata 


tggtcaccat 


900 


cgccacggga 


caggtggcgg 


gaaccactaa 


ggaagggaag 


ccgctgccgc 


cccacgtgcg 


960 


catctgggac 


tcagtttccc 


tctccacctt 


acacgtgctg 


ggcttggggg 


tgtttgacag 


1020 


agccgtgtgc 


tgtgtgggct 


tctccaaatc 


taatggaggc 


aacctgctgc 


gtgcagtgga 


1080 


cgaatccaat 


gatcacatgc 


tctcggtgtg 


ggactgggcc 


aaggagacca 


aggtggtgga 


1140 


tgtcaagtgc 


tccaatgagg 


ctgtattggt 


ggccaccttc 


caccccacgg 


accccactgt 


1200 


gcttatcacc 


tgcgggaaat 


ctcacatcta 


cttctggacc 


ttggaggggg 


gcagcttgag 


1260 


caagcggcaa 


ggcctctttg 


agaaacatga 


gaaaccgaag 


tatgtgctgt 


gtgtgacctt 


1320 


tttggaaggt 


ggcgacgtgg 


tcacggggga 


ctctgggggg 


aacctccatg 


tttggggcaa 


1380 
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aggtgggaac cgcatcacac aggcggcgct gggcgcccac gacggcggcg tgtttgggct 144 0 

ctgcgccctg cgggacggga cgctggtgtc tggagggggc cgtgatcggc gggtggtcct 1500 

ctggggttct gactacagca agctgcagga agtggaggtc cctgaggact ttggccctgt X560 

gcgcaccgtg gcagagggcc acggagacac actgtacgtg gggaccaccc gcaattccat 1620 

cctgcagggc tccgtgcaca caggcttctc actgccggtc cagggccatg tggaagagct 1680 

gtggggcctg gccacacacc ccagtcgggc ccagtttgtg acctgcgggc aggataagct 174 0 

ggtgcatcta tggagctcag attcccacca gcccctgtgg agcaggatca tcgaggaccc 1800 

tgcccgctca gccggcttcc accccagtgg ctctgtcctg gctgtgggta cagtgactgg I860 

cagatggctg ctgctggaca cggagaccca tgacccggtg gctatccaca cagacggcaa 192 0 

tgaacagatc tcagtggtca gcttctcccc agacggggcg tacctggccg tgggctccca 1980 

cgacaacctg gtgtacgtgt acacggtgga ccagggcggc cgcaaggtca gccgcctggg 204 0 

caagtgctcg ggccattcca gctttatcac ccacctggat tgggcccagg acagcagctg 2100 

ctttgtcacc aactccgggg actatgagat tctgtactgg gacccggcta cctgtaagca 2160 

gatcaccagt gcggatgctg tgaggaacat ggaatgggcc acagctactt gtgtcctagg 2220 

gtttggggtg tttgggatct ggtctgaggg ggcggacggc actgatatca acgctgtggc 2280 

ccgctctcat gatgggaagt tgctggcttc agctgatgac tttggcaaag ttcacctgtt 2340 

tagctacccc tgctgtcagc ctcgagccct cagccacaag tacggtggac acagcagcca 2400 

tgtgacaaat gtggccttct tgtgggatga cagcatggcc ctgaccacag ggggcaagga 24 60 

caccagtgtg ctacagtggc gggtggtctg atgcggccag ggaagagtca ggtgtcaggg 2520 

caggaattct attttcggga gatgtctatt gccgagtaga gtaatatata cccagagtat 2580 

gtctatagca gagggggtta tgggggcggg agggtagact gacatacaga agtctctatt 264 0 

tatccgggtg ggaagaggga gtcacatcgc tttggggatc cattggtgtt tggtttgggg 2700 

tgttttttaa gttttttctt ttatatcatc cagaaataaa gacacgtaca ctaa 2754 

<210> 45 
<211> 2309 
<212> DNA 
<213> Homo sapien 

<400> 45 

ctcggccccg ttttcatcgg agacctccgg ggagcggtgg gggtggagga atggtttctc 60 

cccttttctg aactgaatac taagaccctt tttttttctt tgtcctttcc tgacagcaaa 120 

accaaagaag ttatcttcag tgtggaggat ggctccgtga aaatgttcct gaggggccgc 180 

cctgtgccca tgatgatccc agacgagctg gcacccacct acagcctgga cacacgctcg 24 0 
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gagctgcctt 


cttgccggct 


caagctggag 


tgggtctatg gctaccgtgg ccgagactgc 


300 


cgggccaacc 


tttatttgct 


gcccaccggg 


gagatagtgt 


actttgtggc 


ctccgtagcc 


360 


gtgctacaca 


gcgtggagga 


gcagaggcag 


cgacactacc 


tgggacacaa 


cgatgacatc 


420 


aaatgcttgg 


ccatccaccc 


agatatggtc 


accatcgcca 


cgggacaggt 


ggcgggaacc 


480 


actaaggaag 


ggaagccgct 


gccgccccac gtgcgcatct gggactcagt 


ttccctctcc 


540 


accttacacg 


tgctgggctt 


gggggtgttt 


gacagagccg 


tgtgctgtgt 


gggcttctcc 


600 


aaatctaatg 


gaggcaacct 


gctgtgtgca 


gtggatgaat 


ccaatgatca 


catgctctcg 


660 


gtgtgggact 


gggccaagga 


gaccaaggtg 


gtggatgtca 


agtgctccaa 


tgaggctgta 


720 


ttggtggcca 


ccttccaccc 


cacggacccc 


actgtgctta 


tcacctgcgg 


gaaatctcac 


780 


atctacttct 


ggaccttgga 


ggggggcagc 


ttgagcaagc ggcaaggcct 


ctttgagaaa 


840 


catgagaaac 


cgaagtatgc 


gctgtgtgtg 


acctttttgg 


aaggtggcga 


cgtggtcacg 


900 


ggggactctg gggggaacct 


ctatgtttgg ggcaaaggtg ggaaccgtat 


cacacaggcg 


960 


gtgctgggcg 


cccacgacgg 


cggcgtgttt 


gggctctgcg 


ccctgcggga 


cgggacgctg 


1020 


gtgtctggag ggggccgtga 


tcggcgggtg 


gtcctctggg gttctgacta 


cagcaagctg 


1080 


caggaagtgg 


aggtccctga 


ggactttggc 


cctgtgcgca 


ccgcggcaga 


gggccacgga 


1140 


gacacactgt 


acgtggggac 


cacccgcaat 


tccatcctgc 


agggctccgt 


gcacacaggc 


1200 


ttctcactgc 


tggtccaggg 


ccatgtggaa 


gagctgtggg 


gcctggccac 


acaccccagt 


1260 


cgggcccagt 


ttgtgacctg 


cgggcaggat 


aagctggtgc 


atctatggag 


ctcagattcc 


1320 


caccagcccc 


tgtggagcag 


gatcatcgag 


gaccctgccc 


gctcagccgg 


cttccacccc 


1380 


agtggctctg 


tcctggctgt 


gggtacagtg 


actggcagat 


ggctgctgct 


ggacacggag 


1440 


acccatgacc 


tggtggctat 


ccacacagac 


ggcaatgaac 


agatctcagt 


ggtcagcttc 


1500 


tccccagacg gggcgtacct 


ggccgtgggc 


tcccacgaca 


acttggtgta 


cgtgtacacg 


1560 


gtggaccagg 


gcggccgcaa 


ggtcagccgc 


ctgggcaagt 


gctcgggcca 


ttccagtttt 


1620 


atcacccacc 


tggactgggc 


ccaggacagc 


agctgctttg 


tcaccaactc 


cggggactat 


1680 


gagattctgt 


actgggaccc ggctacctgt 


aagcagatca 


ccagtgcgga 


tgctgtgagg 


1740 


aacatggaat gggccacagc tacttgtgtc ctagggtttg gggtgtttgg gatctggtct 


1800 


gagggggcgg 


acggcactga 


tatcaacgct 


gtggcccgct 


ctcatgatgg gaagttgctg 


1860 


gcttcagctg 


atgactttgg 


caaagttcac 


ctgtttagct 


acccctgctg 


tcagcctcga 


1920 


gccctcagcc 


acaagcacgg 


tggacacagc 


agccatgtga 


caaatgtggc 


cttcttgtgg 


1980 


gatgacagca 


tggccctgac 


cacagggggc 


aaggacacca gcgtgctaca 


gtggcgggtg 


2040 


gtctgatgcg gccagggaag 


agtcaggtgt 


cagggcagga 


attctatttt 


cgggagatgt 


2100 
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ctattgccga gtagagtaat atatacccag 
gcgggagggt agactgacat acagaagtct 
atcgcttcgg ggacccattg gtgtttggtt 
tcatccagaa ataaagacac gtacactaa 

<:210> 46 

<211> 2312 

<212> DNA 

<213> Homo sapien 

<400> 46 



ctccaccccc 


Qqccccqaag 


ctccgccacc 


agaccctctc 


gcagaccacc 


ccccatctcc 


aaaaccaaag 


aagttatctt 


cagtgtggag 


cgccctgtgc 


ccatgatgat 


cccagacgag 


tcqqaqctqc 


cttcttgccg 


gctcaagctg 


tqccqqqcca 


acctttattt 


gctgcccacc 


gccgtgctat 


acagcgtqga 


ggaqcagagg 


atcaaatgct 


tggccatcca 


cccagatatg 


accactaagg 


aagggaagcc 


gctgccgccc 


tccaccttac 


acQtqctqqq 


Cttqqqqqtq 


tccaaatcta 


atggaggcaa 


cctgctgtgt 


tcggtgtggg 


actgggccaa 


ggagaccaag 


gtattggtgg 


ccaccttcca 


ccccacggac 


cacatctact 


tctggacctt 


ggaggggggc 


aaacatgaga 


aaccgaagta 


tgtgctgtgt 


acgggggact 


ctggggggaa 


cctctatgtt 


gcggtgctgg 


gcgcccacga 


cggcggcgtg 


ctggtgtctg 


gagggggccg 


tgatcggcgg 


ctgcaggaag 


tggaggtccc 


tgaggacttt 


ggagacacac 


tgtacgtggg 


gaccacccgc 


ggcttctcac 


tgctggtcca 


gggccatgtg 


agtcgggccc 


agtctgtgac 


ctgcgggcag 


tcccaccagc 


ccctgtggag 


caggatcatc 
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agtatgtcta tagcagaggg ggttatgggg 2160 
ctatttatcc gggtgggaag agggagtcac 2220 
tggggtgttt tttaagtttt ttcttttata 2280 

2309 



cgccgccatg agtagctttg gagctgggtg 60 

tctccgcgct cccggggctt agatctcagc 120 

gatggctccg tgaaaatgtt cctgaggggc 180 

ctggcaccca cctacagcct ggacacacgc 24 0 

gagtgggtct atggccaccg tggccgagac 3 00 

ggggagatag tgtactttgt ggcctccgta 360 

cagcgacact acctgggaca caacgatgac 420 

gtcaccatcg ccacgggaca ggtggcggga 4 80 

cacgtgcgca tctgggactc agtttccctc 540 

tttgacagag ccgtgtgctg tgtgggcttc 600 

gcagtggatg aatccaatga tcacatgctc 660 

gtggtggatg tcaagtgctc caatgaggct 720 

cccactgtgc ttatcacctg cgggaaatct 780 

agcttgagca agcggcaagg cctctttgag 840 

gtgacctttt tggaaggtgg cgacgtggtc 900 

tggggcaaag gtgggaaccg tatcacacag 960 

tttgggctct gcgccctgcg ggacgggacg 1020 

gtggtcctct ggggttctga ctacagcaag 1080 

ggccctgtgc gcaccgtggc agagggccac 1140 

aattccatcc tgcagggctc cgtgcacaca 1200 

gaagagctgt ggggcctggc cacacacccc 1260 

gataagctgg tgcatctatg gagctcagat 1320 

gaggaccctg cccgctcagc cggcttccac 1380 
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cccagtggct 


ctgtcctggc 


tgtgggtaca 


gtgactggca 


gatggctgct 


gctggacacg 


1440 


gagacccatg 


acctggtggc 


tatccacaca 


gacggcaatg 


aacagatccc 


agtggtcagc 


1500 


ttctccccag 


acggggcgta 


cctggccgtg 


ggctcccacg 


acaactcggt 


gcacgtgtac 


1560 


acggtggacc 


agggcggccg 


caaggtcagc 


cgcctgggca 


agtgctcggg 


ccattccagt 


1620 


tttatcaccc 


acctggattg 


ggcccaggac 


agcagctgct 


ttgtcaccaa 


ctccggggac 


1680 


tatgagattc 


tgtactggga 


cccggctacc 


tgtaagcaga 


tcaccagcgc 


ggatgctgtg 


1740 


aggaacatgg 


aatgggccac 


agctacttgt 


gtcctagggt 


ttggggtgtt 


tgggatctgg 


1800 


tctgaggggg 


cggacggcac 


tgatatcaac 


gctgtggccc 


gctctcatga 


tgggaagttg 


1860 


ctggcttcag 


ctgatgactt 


tggcaaagtt 


cacctgttta 


gctacccctg 


ctgtcagcct 


1920 


cgagccctca 


gccacaagta 


cggtggacac 


agcagccatg 


tgacaaatgt 


ggccttcttg 


1980 


cgggatgaca 


gcatggccct 


gaccacaggg 


ggcaaggaca 


ccagtgtgct 


acagtggcgg 


2040 


gtggtctgat 


gcggccaggg 


aagagtcagg 


^gc cagggca 


ggaat cctaC 


L uucgggaga 


2100 


tgtctattgc 


cgagcagagt 


aatatatacc 


cagagcatgt 


ctatagcaga 


gggggttatg 


2160 


ggggcgggag 


ggtagactga 


catacagaag 


tctctattta 


tccgggcggg 


aagagggagt 


2220 


cacatcgctc 


tggggatcca 


ttggtgtttg 


gcctggggtg 


ttttctaagt 


tttctctttt 


2280 


atatcatcca 


gaaacaaaga 


cacgtacacc 


aa 






2312 


<210> 47 
<211> 2826 
<212> DNA 
<213> Homo sapien 












<400> 47 
ggccagacgg 


tacctagggg 


gatccgggcg 


cagcaatcct 


ggccctagga 


ttgtgttggg 


60 


gggcttggca 


ggcggtgtcc 


cgacccccct 


ctctgggcat 


cctggggatg 


accccagcgc 


120 


cgggcccggc 


gccggccgcc 


tgatgccagg 


cgagctgagc 


tggggatgct 


ggaacgaagg 


180 


gcgttgctat 


ggcaacggga 


ggcagggcct 


ggttgggggg 


accgggcccg 


ggccgggacc 


240 


gggggggCCg 


gcggtggctg 


tggtggggcg 


atggcggagc 


gcggcccggc 


cttctgcggc 


300 


ctgtacgaca 


cgtcctcgct 


gctgcgatac 


tgcaacgatg 


acaatttgcc gggcacgagc 


360 


ggtatggaag 


tggacgaccg 


cgtgtcggcg 


ctggagcagc ggctgcagtt 


acaggaagac 


420 


gagctggcgg 


tcctaaaggc 


ggcgctggcg 


gatgctctgc gtcgcccgcg ggcatgcgaa 


480 


gaacagggag 


cggcgctacg 


cgcgcggggc 


acccccaagg 


gccgggcgcc 


tccgcgctta 


540 


ggcaccaccg 


cctcggtgtg 


tcagctcttg 


aaaggccttc 


ccaccaggac gccccttaat 


600 


ggctcgggac 


ccccgcggcg 


cgtgggtggc 


tatgccacgt 


ccccatcctc 


tcccaagaag 


660 
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gaggcgacct 


ccgggcgcag 


CagcgCCCgC CgCCaCCCgC caccoyaycy cuuv,.ywt.u^.y 


720 


gtgcgccgtg aggacccccg 


cagccggacc acacccccca gcoywaacty uaywywVi.o»o 


780 






agccaccccc agcgcggagg acgyctccyt. yaeiciauyui.(- 


840 


ctyaygy y 


ywww ^y wywwr 


catgatgatc ccagacgagc tggcacccac ctacagcctg 


900 


gacacacy ci- 


wyy Ay w L.y w w 


ttcttgccgg cccaagctgg agcggg»-cca cggctaccgt. 


960 


rtrt ^ 5> rt A t" 

yy w^yo^o^ ^ 


y w v»y y y Vip V* w 


ccCCtatttg ctgcccaccg gggagatagt gtautttguy 


1020 


y WW w W9>j 


ccgcgctaca 


cagcgtggag gagcagaggc agcgacacca ccc-gggacac 


1080 


doiwyciiuyoiwa 


W wooLO w w w 


MM^M^^at^MM ^^s^/^S^^n^ ^A^^nnA^AQ 

ggccacccac ccagacacgg ccaccatcgc cat^yyycu^ay 


1140 


gtgyuyyyoo 


ccactaagga 


agggaagccg ctgccgcccc awyi.yt.yuau wuyyyawuwc* 


1200 


gtttccctct 


ccaccttaca 


cgtgctgggc ttgggggtgc ctgacagagc cgtgtgctgu 


1260 


gtgggcttct 


ccaaatctaa 


tggaggcaac ctgctgtgtg cagtggatga atccaatgat 


1320 


cacatgccct 


cggtgtggga 


ctgggccaag gagaccaagg tggtggatgt caagtgctcc 


1380 


aatgaggctg 


cattggtggc 


caccttccac cccacggacc ccactgtgct tatcacctgc 


1440 


gggaaatctc 


acatctactt 


ctggaccttg gaggggggca gcttgagcaa gcggcaaggc 


1500 


ctctttgaga 


aacatgagaa accgaagtat gtgctgtgtg tgaccttttt ggaaggtggc 


1560 


gacgtggtca cgggggactc tggggggaac ctctatgttt ggggcaaagg tccctgagga 


1620 


ctttggccct 


gtgcgcaccg 


tggcagaggg ccacggagac acaccgtacg tggggaccac 


1680 


ccgcaattcc atcctgcagg gctccgtgca cacaggcttc tcactgctgg tccagggcca 


1740 


tgtggaagag 


ctgtggggcc 


tggccacaca ccccagtcgg gcccagtttg tgacctgcgg 


1800 


gcaggataag 


ctggtgcatc 


tatggagctc agattcccac cagcccctgt ggagcaggat 


1860 


catcgaggac 


cctgcccgct 


cagccggctt ccaccccagt ggctctgtcc tggctgtggg 


1920 


tacagtgact 


ggcagatggc 


tgctgctgga cacggagacc catgacctgg tggctatcca 


1980 


cacagacggc 


aacgaacaga 


tctcagtggt cagcttctcc ccagacgggg cgtacctggc 


2040 


cgtgggctcc 


cacgacaact tggtgtacgt gtacacggtg gaccagggcg gccgcaaggt 


2100 


cagccgcctg 


ggcaagtgct 


cgggccattc cagttttatc acccacctgg attgggccca 


2160 


ggacagcagc 


tgctttgtca 


ccaaccccgg ggactatgag attctgtact gggacccggc 


2220 


tacctgtaag 


cagatcacca 


gtgcggatgc tgtgaggaac atggaatggg ccacagctac 


2280 


ttgtgtccta 


gggtttgggg 


tgtttgggat ccggtctgag ggggcggacg gcactgatat 


2340 


caacgctgtg gcccgctctc 


atgatgggaa gttgctggct tcagctgatg actttggcaa 


2400 


agctcacctg 


tttagctacc cctgctgtca gcctcgagcc ctcagccaca agtacggtgg 


2460 
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acacagcagc catgtgacaa atgtggcctt 

agggggcaag gacaccagtg tgctacagtg 

caggtgtcag ggcaggaatt ctactttcgg 

tacccagagt atgtctatag cagagggggt 

gaagtctcta tttatccggg tgggaagagg 

tttggtttgg ggtgtttttt aagttttttc 
cactaa 

<210> 48 

<211> 2118 

<212> DNA 

<213> Homo sapien 



<400> 48 



ctccaccccc 


ggccccgaag 


ctccgccacc 


aaccaaagaa 


gttatcttca 


gtgtggagga 


ccctgtgccc 


atgatgatcc 


cagacgagct 


ggagctgcct 


tcttgccggc 


tcaagctgga 


ccgggccaac 


ctttatttgc 


tgcccaccgg 


cgtgctatac 


agcgtggagg 


agcagaggca 


caaatgcttg 


gccatccacc 


cagatatggt 


cactaaggaa 


gggaagccgc 


tgccgcccca 


caccttacac 


gtgctgggct 


tgggggtgtt 


caaatctaat 


ggaggcaacc 


tgctgtgtgc 


ggtgtgggac 


tgggccaagg 


agaccaaggt 


attggtggcc 


accttccacc 


ccacggaccc 


catctacttc 


tggaccttgg 


aggggggcag 


acatgagaaa 


ccgaagtatg 


tgctgtgtgt 


gggggactct 


g99999aacc 


tctatgtttg 


ggtgctgggc 


gcccacgacg 


gcggcgtgtt 


ggtgtctgga 


gggggccgtg 


atcggcgggt 


gcaggaagtg 


gaggtccctg 


aggactttgg 


agacacactg 


tacgtgggga 


ccacccgcaa 


cttctcactg 


ctggtccagg 


accctgcccg 
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cttgtgggat 


gacagcatgg 


ccctgaccac 


2520 


gcgggtggtc 


tgatgcggcc 


agggaagagt 


2580 


gagatgtcta 


ttgccgagta 


gagtaacata 


2640 


tatgggggcg 


ggagggtaga 


ctgacataca 


2700 


gagtcacatc 


gctttgggga 


tccattggtg 


2760 


ttttatatca 


tccagaaata 


aagacacgta 


2820 
2826 


cgccgccatg 


agtagctttg 


gagctggcaa 


60 


tggctccgtg 


aaaatgttcc 


tgaggggccg 


120 


ggcacccacc 


tacagcctgg 


acacacgctc 


180 


gtgggtctat 


ggctaccgtg 


gccgagactg 


240 


ggagatagtg 


tactttgtgg 


cctccgtagc 


300 


gcgacactac 


ctgggacaca 


acgatgacat 


360 


caccatcgcc 


acgggacagg 


tggcgggaac 


420 


cgtgcgcatc 


tgggactcag 


tttccctctc 


480 


tgacagagcc 


gtgtgctgtg 


tgggcttctc 


540 


agtggatgaa 


tccaatgatc 


acatgctctc 


600 


ggtggatgtc 


aagtgctcca 


atgaggctgt 


660 


cactgtgctt 


atcacctgcg 


ggaaatctca 


720 


cttgagcaag 


cggcaaggcc 


tctttgagaa 


780 


gacctttttg 


gaaggtggcg 


acgtggtcac 


840 


gggcaaaggt 


gggaaccgta 


tcacacaggc 


900 


tgggctctgc 


gccctgcggg 


acgggacgct 


960 


ggtcctctgg 


ggttctgact 


acagcaagct 


1020 


ccctgtgcgc 


accgtggcag 


agggccacgg 


1080 


ttccatcctg 


cagggctccg 


tgcacacagg 


1140 


cccagccggc 


ttccacccca 


gtggctctgt 


1200 
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cctggctgtg ggtacagtga ctggcagatg gctgctgctg gacacggaga cccatgacct 1260 

ggtggctatc cacacagacg gcaatgaaca gatctcagtg gccagcttct ccccagacgg 1320 

ggcgtacctg gccgtgggct cccacgacaa cttggtgcac gtgtacacgg tggaccaggg 1380 

cggccgcaag gtcagccgcc tgggcaagtg ctcgggccat tccagtttta tcacccacct 1440 

ggattgggcc caggacagca gctgctttgt caccaactcc ggggactatg agattctgta 1500 

ctgggacccg gctacctgta agcagatcac cagtgcggat gctgtgagga acatggaatg 1560 

ggccacagct acttgtgtcc tagggcttgg ggtgtttggg atctggtctg agggggcgga 1620 

cggcactgat atcaacgccg tggcccgctc tcatgatggg aagttgctgg cttcagctga 1680 

tgactttggc aaagttcacc tgtttagcta cccctgctgt cagcctcgag ccctcagcca 1740 

caagtacggt ggacacagca gccatgtgac aaatgtggcc ttcttgtggg atgacagcat 1800 

ggccctgacc acagggggca aggacaccag tgtgctacag tggcgggtgg tctgatgcgg 1860 

ccagggaaga gtcaggtgtc agggcaggaa ttctatcttc gggagacgtc tattgccgag 1920 

tagagtaata tatacccaga gtatgtctat agcagagggg gttatggggg cgggagggta 1980 

gactgacata cagaagtctc tatttatccg ggtgggaaga gggagtcaca tcgctttggg 2040 

gatccattgg tgtttggttc ggggtgttct ttaagctttt tcttttatat catccagaaa 2100 

taaagacacg tacactaa 2118 

<210> 49 
<211> 2152 
<212> DNA 
<213> Homo sapien 

<400> 49 

ctccaccccc ggccccgaag ctccgccacc cgccgccatg agtagctttg gagctggcaa 60 

aaccaaagaa gttatcttca gtgtggagga tggctccgtg aaaatgttcc tgaggggccg 120 

ccctgtgccc atgatgatcc cagacgagct ggcacccacc tacagcctgg acacacgctc 180 

9gagctgcct tcttgccggc tcaagctgga gtgggtctat ggctaccgtg gccgagactg 240 

ccgggccaac ctttatttgc tgcccaccgg ggagatagtg tactttgtgg cctccgtagc 300 

cgtgctatac agcgtggagg agcagaggca gcgacactac ctgggacaca acgatgacac 360 

caaatgcttg gccatccacc cagatatggt caccatcgcc acgggacagg tggcgggaac 420 

cactaaggaa gggaagccgc tgccgcccca cgtgcgcatc tgggactcag tttccctctc 480 

caccttacac gtgctgggct tgggggtgtt tgacagagcc gtgtgctgtg tgggcttctc 540 

caaatctaat ggaggcaacc tgctgtgtgc agtggatgaa tccaatgatc acatgctctc 600 

ggtgtgggac tgggccaagg agaccaaggt ggtggatgtc aagtgctcca atgaggctgt 660. 
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attggtggcc 


accttccacc 


ccacggaccc 


catctactcc 


tggaccttgg 


aggggggcag 


acatgagaaa 


ccgaagtatg 


cgctgtgtgt 


gggggactct 


ggggggaacc 


tctatgtttg 


ggtgctgggc 


gcccacgacg 


gcggcgtgtt 


ggcgtctgga 


gggggccgtg 


atcggcgggt 


gcaggaagtg 


gaggtccctg 


aggactttgg 


agacacactg 


tacgtgggga 


ccacccgcaa 


cttctcactg 


ctggtccagg 


gccatgtgga 


tcgggcccag 


Cttgtgacct 


gcgggcagga 


ccaccagccc 


ctgtggagca 


ggatcatcga 


cagtggccct 


gtcctggctg 


tgggtacagt 


gacccatgac 


ctggtggcta 


tccacacaga 


ctccccaggg 


ccattccagt 


tttatcaccc 


ttgtcaccaa 


ctccggggac 


tatgagattc 


tcaccagtgc 


ggatgctgtg 


aggaacatgg 


ctggggtgtt 


cgggatctgg 


tctgaggggg 


gctctcatga 


tgggaagttg 


ctggcttcag 


gctacccctg 


ctgtcagcct 


cgagccctca 


tgacaaatgc 


ggccttcttg 


tgggatgaca 


ccagtgtgct 


acagtggcgg 


gtggtctgat 


ggaattctat 


tttcgggaga 


tgtctattgc 


ctatagcaga 


gggggttatg 


ggggcgggag 


tccgggtggg 


aagagggagt 


cacatcgctt 


ttttttaagt 


tttttctttt 


atatcatcca 



<210> 50 

<211> 1899 

<212> DNA 

<213> Homo sapien 

<400> 50 

aggctaggta gcgggctggg tgtgaatcgg 
cactaaggaa gggaagccgc tgccgcccca 
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cactgtgcct 


atcacctgcg 


ggaaatctca 


720 


cttgagcaag 


cggcaaggcc 


tctttgagaa 


780 


gacctttttg 


gaaggtggcg 


acgtggtcac 


840 


gggcaaaggt 


gggaaccgta 


tcacacaggc 


900 


tgggctccgc 


gccctgcggg 


acgggacgct 


960 


ggtcctctgg 


ggttctgact 


acagcaagct 


1020 


ccctgtgcgc 


accgtggcag 


agggccacgg 


1080 


ttccatcctg 


cagggctccg 


tgcacacagg 


1140 


agagctgtgg 


ggcctggcca 


cacaccccag 


1200 


taagctggtg 


catctatgga 


gctcagattc 


1260 


ggaccctgcc 


cgctcagccg 


gcttccaccc 


1320 


gactggcaga 


tggctgctgc 


tggacacgga 


1380 


cggcaatgaa 


cagatctcag 


tggtcagctt 


1440 


acctggattg 


ggcccaggac 


agcagctgct 


1500 


tgtactggga 


cccggctacc 


tgtaagcaga 


1560 


aatgggccac 


agctacttgt 


gtcctagggt 


1620 


cggacggcac 


tgatatcaac 


gctgtggccc 


1680 


ctgatgactt 


tggcaaagtt 


cacctgttta 


1740 


gccacaagta 


cggtggacac 


agcagccatg 


1800 


gcatggccct 


gaccacaggg 


ggcaaggaca 


1860 


gcggccaggg 


aagagtcagg 


tgtcagggca 


1920 


cgagtagagt 


aatatatacc 


cagagtatgt 


1980 


ggtagactga 


catacagaag 


tctctattta 


2040 


tggyga wcca 




gcccggggcg 




gaaataaaga 


cacgtacact 


aa 


2152 


ccgaggtcgc 


ccgggacagg 


tggcgggaac 


60 


cgtgcgcacc 


tgggactcag 


tttccctctc 


120 
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caccttacac 


gtgctgggct 


caaatcttgc 


tccaatgagg 


gcttatcacc 


tgcgggaaat 


caagcggcaa 


ggcctctttg 


tttggaaggt 


ggcgacgtgg 


aggcgggaac 


cgtatcacac 


ctgcgccctg 


cgggacggga 


ctggggttct 


gactacagca 


gcgcaccgtg 


gcagagggcc 


cctgcagggc 


tccgtgcaca 


tttaactcca 


agcacagcag 


cactctgatt 


gggggtggca 


acaccccagt 


cgggcccagt 


ctcagattcc 


caccagcccc 


cttccacccc 


agtggctctg 


ggacacggag 


acccatgacc 


ggtcagcttc 


tccccagacg 


cgtgtacacg 


gtggaccagg 


ttccagtttt 


atcacccacc 


cggggactat 


gagattctgt 


tgctgcgagg 


aacatggaat 


gatctggtct 


gagggggcgg 


gaagttgctg 


gcttcagctg 


tcagcctcga 


gccctcagcc 


cttcttgtgg 


gatgacagca 


gtggcgggtg 


gtctgatgcg 


cgggagatgt 


ctattgccga 


ggttatgggg 


gcgggagggt 


agggagtcac 


atcgctttgg 


ttcttttata 


tcatccagaa 



64 

tgggggtgtt tgacagagcc 
ctgtattggt ggccaccttc 
ctcacatcta cttctggacc 
agaaacatga gaaaccgaag 
tcacggggga ctctgggggg 
aggcggtgct gggcgcccac 
cgctggtgtc tggagggggc 
agctgcagga agtggaggtc 
acggagacac actgtacgtg 
caggcttctc actgctggtc 
aggggcccca ggcccctgct 
cccttcaggg ccatgtggaa 
ttgtgacctg cgggcaggat 
tgtggagcag gatcatcgag 
tcctggctgt gggtacagtg 
tggtggctat ccacacagac 
gggcgtacct ggccgtgggc 
gcggccgcaa ggtcagccgc 
tggattgggc ccaggacagc 
actgggaccc ggctacctgt 
gggccacagc tacttgtgtc 
acggcactga tatcaacgct 
atgactttgg caaagttcac 
acaagtacgg tggacacagc 
tggccctgac cacagggggc 
gccagggaag agtcaggtgt 
gtagagtaat atatacccag 
agactgacat acagaagtct 
ggatccattg gtgtttggtt 
ataaagacac gtacactaa 
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gtgtgctgtg 


tgggcttctc 


180 


caccccacgg 


accccactgt 


240 


ttggaggggg 


gcagcttgag 


300 


tatgtgctgc 


gtgtgacctt 


360 


aacctctatg 


tttggggcaa 


420 


gacggcggcg 


tgtttgggct 


480 


cgtgatcggc 


gggtggtcct 


540 


cctgaggact 


ttggccctgt 


600 


gggaccaccc 


gcaattccat 


660 


caggatccag 


caaccaagag 


720 


ccaactgtgc 


tgcctcctgc 


780 


gagctgtggg 


gcctggccac 


840 


aagctggtgc 


atctatggag 


900 


gaccctgccc 


gctcagccgg 


960 


actggcagat 


ggctgctgct 


1020 


ggcaatgaac 


agatctcagt 


1080 


tcccacgaca 


acttggtgta 


1140 


ctgggcaagt 


gctcgggcca 


1200 


agctgccttg 


tcaccaactc 


1260 


aagcagatca 


ccagtgcgga 


1320 


ctagggtttg 


gggtgtttgg 


1380 


gtggcccgct 


ctcatgatgg 


1440 


ctgtttagct 


acccctgctg 


1500 


agccatgtga 


caaatgtggc 


1560 


aaggacacca 


gtgtgctaca 


1620 


cagggcagga 


attctatttt 


1680 


agtatgtcta 


tagcagaggg 


1740 


ctatttatcc 


gggtgggaag 


1800 


tggggtgttt 


tttaagtttt 


1860 



1899 
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<210> 51 
<211> 2951 
<212> DNA 
<213> Homo sapien 



<400> 51 



Ctcccrc^ss 


cat tgcgsag 


acttcatctg gccaggacac 


tggctgtcca 


cctggcaccg 


60 


9^ccc3ctcag 


aggccgagct 


ggggasagtt 


ratgttcact 


gggggcagga 


acccccct ta 


120 


tctccaggcc 


gcct cccaac 


ctcaggaggc 


cacaagatta 


gcagaaagcc 


acgttgagtc 


18 0 


agccagtaac 


atggagcaac 


tgacaaggga 


aactgaggac 


tattccaaac 


aagccctctc 


240 


actggtgcgc 


aaggcccCgc 


acgaaggagt 


cggaagcgga 


agcggtagcc 


cggacggcgc 


300 


tgtggtgcaa 


gggcttgtgg 


aaaaattgga 


gaaaaccaag 


tccctggccc 


agcagttgac 




aagggaggcc 


actcaagcgg 


aaattgaagc agataggtct 


tatcagcaca 


gtctccgcct 


^ 1 A 


cctggatcca 


gcgccccggc 


tccagggagt 


cagtgatcag 


tcctttcagg 


tggaagaagc 


^ 0 n 


aaagaggatc 


aaacaaaaag 


cggattcact 


ctcaagcctg gtaaccaggc 


atatggatga 


54 U 


gtt caagcgti 


acacagaaga 


atctgggaaa 


ctggaaagaa 


gaagcacagc 


agctct taca 


0 UU 


gaatggciaaa 


cigtgggagag 


agaaatcaga 


tcagctgctt 


tcccgtgcca 


atct tgctaa 


c c t\ 


aagcagagca 


caagaagcac 


tgagtatggg 


caatgccact 


ttttatgaag 


ttgagagcat 


720 


cctcaaaaac 


ctcagagagt 


ttgacctgca 


ggtggacaac agaaaagcag 


aagctgaaga 


*7 0 A 

780 


agccatgaag 


agactctcct 


acatcagcca gaaggtttca gatgccagtg 


acaagaccca 


0 >l A 

84 0 


gcaagcagaa 


agagccctgg 


ggagcgctgc 


tgctgatgca 


cagagggcaa 


agaatggggc 


0 A A 




ctggaaatct 


ccagtgagat 


tgaacaggag attgggagtc 


tgaacttgga 


960 


agccaatgtg 


acagcagatg 


gagccttggc 


catggaaaag ggactggcct 


ctctgaagag 


1020 


Cgagatgagg 


gaagtggaag 


gagagctgga 


aaggaaggag 


ctggagtttg 


acacgaatat 


1080 


ggatgcagta 


cagacggtga 


ttacagaagc 


ccagaaggtt 


gataccagag 


ccaagaacgc 


1140 


tggggttaca 


atccaagaca 


cactcaacac 


attagacggc 


ctcctgcatc 


tgatggacca 


1200 


gcctctcagt 


gtagatgaag 


aggggctggt 


cttaccggag cagaagcttc 


cccgagccaa 


1260 


gacccagatc 


aacagccaac 


tgcggcccat 


gatgtcagag 


ctggaagaga 


gggcacgtca 


1320 


gcagaggggc 


cacctccatt 


tgctggagac aagcatagac 


gggattctgg 


ctgatgtgaa 


1380 


gaactcggag 


aacattaggg 


acaacctgcc 


cccaggctgc 


tacaataccc 


aggctcttga 


1440 


gcaacagtga 


agctgccata 


aatatttctc 


aactgaggtt 


cttgggatac 


agatctcagg 


1500 


gctcgggagc 


catgtcatgt 


gagtgggtgg gatggggaca 


tttgaacatg 


tttaatgggt 


1560 


atgctcaggt 


caactgacct 


gaccccattc 


ctgatcccat 


ggccaggtgg 


ttgtcttatt 


1620 


gcaccatact 


ccttgcttcc 


tgatgctggg 


caatgaggca 


gatagcactg 


9gtgtgagaa 


1680 
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tgatcaagga 


tctggacccc 


aaagaataga 


tgtttgcctc 


ataatagtcg 


taagtggagt 


tagtcaactt 


attctttgag 


taatgtgact 


tgaaattctt 


cctaatgtca 


gaacagagcg 


ctattgcccc 


atattgtcct 


ctgcaagctt 


cccagggtgt 


gaacatgttc 


tccattttca 


ggacctgtaa 


ggcaggccca 


ttcagagcta 


ttctggacct 


gggcatgaca 


tcctttcttt 


tctttattaa 


agcatttcct 


accagcaaag 


tttcaaagtg 


atagaaaagt 


gtggcttggg 


Ctagtcctaa 


tccaatccta 


cttttcgaac 


tcttattttc 


tcaatctcct 


ctctctttcc 


acacttcagc 


tgggtcacat 


ccatccctcc 


tacctccatc 


catccttcca 


acatatattc 


tgggacagtg 


gtgacatagt 


ctctgccctc 


gcatttttaa 


aaaataaatt 


taaacttaca 


caataaccgc 


ttggtttgca 


acctctttgc 


atgggggcac 


ttgagttttg 


gcaaggctga 


attccagctg 


tcactctgtg 


cctttctaca 


aacaccagtg 


ggaattgctg 


gaggaaccag 


ggtgctgcct 


tgcttctgta 


tttccttgga 


aattgttaga 


t 





<210> 52 

<211> 3643 

<212> DNA 

<213> Homo sapien 

<400> 52 

tgccggaaac agaataatgg cgtctcgtag 
gtcgattgga tgaacgcagc tgagattacc 
gtggcgtccc acgcctcgtg cgacagtggg 
tggtggtggt ggtggcgtct tcctgtctca 
gagacaatga gtggtatgat ccaaatgctt 
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ctggatggaa 


agacaaactg 


cacaggcaga 


1740 


cctggaattt 


ggacaagtgc 


tgttgggata 


1800 


aaaggaaaaa 


actctgactt 


tgcccaggca 


1860 


caacccagtc 


acactgtggc 


cagtaaaata 


1920 


cttgctgatc 


agagttcctc 


ctacttacaa 


1980 


agctggaaga 


agcgagcagt 


gttggagtga 


2040 


tggtgcttgc 


tggtgcctgc 


caccttcaag 


2100 


taatgatgcc 


atggcaactt 


agagattgca 


2160 


caaatgttgg 


gaaagtattt 


actttttcgg 


2220 


cattgaaaga 


ggtaaaattc 


tctagattta 


2280 


accaaaaatg 


atgcgcatca 


atgtatttta 


2340 


tccacccata 


ataagagaat 


gttcctactc 


2400 


attcatcctt 


ccatccatct 


ttccatccat 


2460 


attgagtacc 


tactgtgtgc 


caggggctgg 


2520 


atagagttga 


ttgtctagcg 


aggaagacaa 


2580 


aactttgttt 


gtcacaagtg 


gtgtttattg 


2640 


tcaacagaac 


atatgttgca 


agaccctccc 


2700 


cagagctctg 


ggttgtgcac 


atttctttgc 


2760 


actgattgca 


acagactgtt 


gagttatgat 


2820 


aggcacttcc 


accttggctg 


ggaagactat 


2880 


ttttcctgaa 


agtgttttta 


aataaagaac 


2940 








2951 


ccccaggcga 


cagcgtggag 


gggcgggtct 


60 


cccagccact 


aaggacgaag 


aggtggggcg 


120 


cggggctttg 


ttgcctgagt 


aaccgtatga 


180 


acgataccta 


ttttctagtg 


ctgagatcct 


240 


cacttcttcg 


acgttatgag 


ctttcatttt 


300 
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acccagggga 


tggatctgtt 


gaaatgcatg 


atgtaaagaa 


tcatcgcacc 


tttttaaagc 


360 


ggaccaaata 


tgataacctg 


cacttggaag 


atttacttat 


aggcaacaaa 


gcgaatgtct 


420 


tttctcgaca 


actggtatta 


attgactatg 


gggatcaata 


tacagctcgc 


cagctgggca 


480 


gtaggaaaga 


aaaaacgcta 


gccctaatta 


aaccagatgc 


aatatcaaag 


gctggagaaa 


540 


taattgaaat 


aataaacaaa 


gctggattta 


ctataaccaa 


actcaaaatg 


atgatgcttt 


600 


caaggaaaga 


agcattggat 


tttcatgtag 


atcaccagtc 


aagacccttt 


ttcaatgagc 


660 


tgatccagtt 


tattacaact 


ggccctatca 


ttgccatgga 


gattttaaga 


gatgatgcta 


720 


tatgtgaatg 


gaaaagactg 


ctgggacctg 


caaactctgg 


agtggcacgc 


acagatgctt 


780 


ctgaaagcat 


tagagccctc 


tttggaacag 


atggcataag 


aaatgcagcg 


catggccccg 


840 


attcttttgc 


ctctgcggcc 


agagaaatgg 


agttgttttt 


tccttcaagt 


ggaggtcgtg 


900 


ggccggcaaa 


cactgctaaa 


tttactaatt 


gtacctgttg 


cattgttaaa 


ccccatgctg 


960 


tcagtgaagg 


actgttggga 


aagatcctga 


tggctatccg 


agatgcaggt 


tttgaaatct 


1020 


cagccatgca 


gacgttcaat 


atggatcggg 


ttaatgttga 


ggaattctat 


gaagtttata 


1080 


aaggagtagt 


gaccgaatat 


catgacatgg 


tgacagaaat 


gtattctggc 


ccttgtgtag 


1140 


caatggagat 


tcaacagaat 


aatgctacaa 


agacatttcg 


agaattttgc 


ggacctgctg 


1200 


atcctgaaat 


tgcccggcat 


ttacgccctg 


gaactctcag 


agcaatcttt 


ggtaaaacta 


1260 


agatccagaa 


tgctgttcac 


cgtactgatc 


tgccagagga 


tggcctatta 


gaggttcaat 


1320 


acttcttcaa 


gatcttggat 


aattagtggt 


gtggaaagta 


aagaagtcac 


aggttgggac 


1380 


atttagacaa 


gagtgaatca 


cacacgagga 


atgcgttcat 


tcttttattg 


tccgttgttt 


1440 


taacctgact 


gaatacaaga 


tcaacaagag 


caccgtactc 


ctggcaatta 


ttacatatgt 


1500 


tagaacatgg 


attttgcact 


gtagacaaca 


tttaacacca 


gtctatgggg 


tactgcattg 


1560 


ctttttataa 


agttcaaaat 


aaagatttat 


tttcaaacaa 


gtgactttgg 


tttatttcat 


1620 


acatCacact 


ttttcttgca 


gaaaaaaata 


aaattgtaca 


actgcataaa 


tataaaattc 


1680 


ttccaccatg 


aaaatggtta 


aaacattcat 


aaagacacag 


caccagcatg 


tgatgcctcc 


1740 


agaaaggaaa 


aaaggaaagt 


aaaagaaaat 


gtataaagca 


aattaattag 


ccctgtcact: 


1800 


agccaaagtg 


atgggcagat 


ccagatctca 


gacacagctt 


tagaacatac 


cggaggacaa 


1860 


ggcaagagat 


gtcaagacag 


cattcaatac 


aagcataaca 


caacccatca 


cctttttgtg 


1920 


tttctctgta 


gtgtcactta 


aaatttaaag 


ggcagttccc 


ccatgacaaa 


cccccacccc 


1980 


agcccccagg 


aaaaaaataa 


aaataaagac 


ccaacaccac 


aggaaaagac 


tatggagtat 


2040 


gctaaaaatg 


ctcatcgatg 


ggccattacc 


tttgaaagag 


ccaaaaaaca 


aaaaaaaaaa 


2100 
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aaaaaaaaaa attaccatgc cagttttatt 
accttgctca cataaagtag aaaacagata 
gtatgcacct gtagtactgt acactaaata 
agaaacactg cttactacaa gtcagttacg 
tcccaaggct caatttttct ttttcttttg 
tattgctaca ctttaattta cattctaacc 
atagattaag tgtaggtccc atacgtatga 
tttgtatctt tttttaacct attaaatggc 
tcttaacttc aatttttaca tcaaaacgtc 
tgttctcacc gtacgcctta cactccatgc 
actgtacggc cagcaggggc tgcaggtatt 
aatactgcag aggaaaacca ggggagctgc 
tatccttatc ttcatctcgc ctgccagtgc 
tcatcactgg gtaagtctcc aaggactcat 
ctcggttgag ctttataata atgcacggtt 
ttaatccaga gcaattcccc agccattcaa 
gttcatgatt aaagtctcct cgttctttcg 
aaatcatgtc atccctctgg gctgaatctt 
gtacatatgc ctcatagctc ttgggatcat 
tctgaggaat ctgtgttaat cctggcgggg 
cactgatggt gagcagcatc acttggatgg 
aaaatattac gtagaataga aggatcctaa 
ccttcttctc tgagttccag atgaatttct 
gggccatggc gatggcgggt cagcagctgc 
gcgccctggc gtcctctgcg ccgctcggct 
cgccgctgcc gtcgkctctc ggagtgcgag 

<210> 53 

<211> 1005 

<212> DNA 

<213> Homo sapien 

<400> 53 

ggccgctcgc tgacgccatg tttcctcttt 
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cccgttgaat atttacacct tggacagcaa 2160 

caataaaaca tggcttgaaa aatgaccaga .2220 

aaatacacaa ggcagcaata cttaggggcc 2280 

gaatcataat ttacagtaaa aatgggcacg 2340 

tcatttacag tagaataaat attttgttgc 2400 

cattaaatgc agaaagctag tgtaaagcat 2460 

cagtttgttc aagaccagta ggtttttgtt 2520 

tagtgggaaa gattcgtgct tgtgatcagc 2580 

cctgaaaacg gtctttctca ctgtacccaa 2640 

gaatttcagt gtccatggta agattggtga 2700 

tgggctgcag gagtttgcca tagtacggat 2760 

ccagtccaaa atactccaca tttccaactt 2820 

actgaacggg aaggacattt gggttatact 2880 

tcttgggagg cttaggtttg aagcctagaa 2940 

tgccctcttt gtagccataa gtttcatcat 3000 

gcttgaatct gcagaccttt cgctctcctc 3060 

gctcactggg cacatcgcca caatcttcaa 3120 

tgtacttttc caggaaccta actatgttca 3180 

taggacgaaa ggaaatttca gtcttctgga 3240 

ccactcggtc ctgatatgtg ggcttaaatt 3300 

ttccgatgaa gatgccagcc aggcagccat 3360 

accaactgcc accggtcctg cccagaaact 3420 

tccagctgcc ctcctccttg gctttcccgc 3480 

cgcggtccgg agggtgggtg gctgcggcgc 354 0 

tctccggcct ggctctctgc aggcaggacg 3600 

acgcgcggcg gag 3643 



cggccgcgcc ggtgaacagg acccgtcgcc 60 
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atgggccgtg tgatccgtgg acagaggaag ggcgccgggt ctgtgttccg cgcgcacgtg 120 

aagcaccgta aaggcgctgc gcgcctgcgc gccgtggatt tcgctgagcg gcacggctac 180 

atcaagggca tcgtcaagga catcatccac gacccgggcc gcggcgcgcc cctcgccaag 24 0 

gtggtcttcc gggatccgta tcggtttaag aagcggacgg agctgttcat tgccgccgag 300 

ggcattcaca cgggccagtt tgtgtattgc ggcaagaagg cccagctcaa cattggcaat 360 

gtgctccctg tgggcaccat gcctgagggt acaatcgtgt gctgcctgga ggagaagcct 420 

ggagaccgtg gcaagctggc ccgggcatca gggaactatg ccaccgttat ctcccacaac 480 

cctgagacca agaagacccg tgtgaagctg ccctccggct ccaagaaggt tatctcctca 54 0 

gccaacagag ctgtggttgg tgtggtggct ggaggtggcc gaattgacaa acccatcttg 600 

aaggctggcc gggcgtacca caaatataag gcaaagagga actgctggcc acgagtacgg 660 

99tgtggcca tgaatcctgt ggagcatcct tttggaggtg gcaaccacca gcacatcggc 720 

aagccctcca ccatccgcag agatgcccct gctggccgca aagtgggtct cattgctgcc 780 

cgccggactg gacgtctccg gggaaccaag actgtgcagg agaayatgtg ggckcmaagt 84 0 

gggtttgccg aaaaaaacac aacaacgcaa acacaaaggc aaacatatag aaaaaaaggg 900 

ggctaccggg gccgacactc tagaggtaat ataatagcgg cagaggacag gggagggtag 96 0 

ggcaacactt tattccgtat atttggccgc tctaaaatca cactc 1005 

<210> 54 
<211> 3273 
<212> DNA 
<213> Homo sapien 

<220> 

<221> misc_feature 
<222> (3246) . . (3246) 
<223> n=a, c, g or t 

<400> 54 

gccatggtaa gcgcggacgc catggtaagc gcggacgcca tggtatggta agcgcggacg 60 

ccatggtaag cgcggacgcc atggtaagcg cggacgccat ggtaagcgcg gacgccatgg 120 

taagcgcgga cgccatggta agcgcggacg ccatggtaag cgcggacgcc atggtaagcg 180 

cggacgccat ggtaagcgcg gacgccatgc acacggaccc tgactactcg gctgcctatg 240 

tcgtcataga aactgatgca gaagatggaa tcaaggggtg tggaattacc ttcactctgg 300 

gaaaaggcac tgaagttgtt gtctgtgctg tgaatgccct cgcccaccat gtgctcaaca 360 

aggacctcaa ggacattgtt ggtgacttca gaggcttcta taggcagctc acaagtgatg 420 

ggcagctcag atggattggt ccagaaaagg gcgtggtgca cctggcgaca gcggccgtcc 480 
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taaacgcggt 


gtgggacttg 


tgggccaagc 


tggacatgga 


tcccaggatg 


ctggtatcct 


tgactgagga 


ggatgcccta 


gaaatactgc 


agaagcaaat 


gctggcacaa 


ggataccctg 


acccagatga 


cacgttgaag 


cagctctgtg 


ttaaagtaaa 


ggtgggtgct 


gatctccagg 


acatgattgg 


accggaaaag 


actttgatga 


aggcggtgga 


gtggatgtcc 


aagctggcca 


cctcccctga 


tgacattctg 


gggcacgcca 


ttggcattgc 


cacaggagaa 


cagtgccaca 


cgaaggccct 


gcagttcctc 


cagattgaca 


tctcagtatc 


gctgatggcc 


aaaaagtttg 


ttggcctctg 


tgaactggtg 


cagcacctga 


gccttgaaaa 


tagggtgtgt 


gagtatgctg 


tgatgatcca 


gcgggcttcc 


tacatgcctc 


aggaggaatc 


tgtaaagaaa 


caccagtatc 


ctgctcaaga 


aaattaagtg 


ctcagcccca 


ctaaaatttc 


ttggaaatag 


ttttacaaaa 


gagttcagct 


agaagtcata 


ccaccctcag 


tcggcatcca 


gcccaacccc 


taaagactga 


ctaaacagcc 


acttccattt 


taatagttgg 


ttcagctttg 


aagtcatcaa 


ttttctcaac 


tctgggttct 


cgctgaagct 


gaattttcag 


atctcccaaa 


gtgtgtataa 


agtcacctgg 


atgttagttt 


caattcttta 


aaacatcatc 


ggcttttttt 


aaaaaaataa 


tttagtgtag 


tat* ta tatta 




tattgaactc 


ctcccaaagt 


gctgggatca 


caggcatgag 


agaaaatcca 


tagaagctgt 


atcacaaaca 


cagggccaga 


aaaccttcca 


gaagaggaag 


cttacacata 


tcaaatttaa 


aagctaacca 
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aggagggaaa 


gcctgtctgg 


aagttacttg 


540 


gcatagattt 


caggtacatc 


actgatgtcc 


600 


agaaaggtca 


aattggtaaa 


aaagaaagag 


660 


cttacacgac 


atcgtgcgcc 


tggctggggt 


720 


cccaggcgct 


gaaggatggc 


tggaccaggt 


780 


atgacatgcg 


aagatgccaa 


atcatccgag 


840 


tggatgccaa 


ccagcgctgg 


gatgtgcctg 


900 


agttcaagcc 


attgtggatt 


gaggagccaa 


960 


ccatttccaa 


ggcactggtc 


ccattaggaa 


1020 


atagagcgat 


atttaagcaa 


ctcctacagg 


1080 


gttgcagact 


gggcagtgtc 


aatgagaacc 


1140 


aaattcctgt 


ttgcccccat 


gctggtggag 


1200 


ttatatttga 


ctacatatca 


gtttctgcaa 


1260 


accacctgca 


tgagcatttc 


aagtatcccg 


1320 


ccaaggatcc 


cggctactca 


acagaaatga 


1380 


cagatggtga 


agtttggaag 


aaactcctcc 


1440 


acaacttttt 


tctttctgaa 


gtgaaagggc 


1500 


atggatttaa 


aaaatcctac 


cgaccaagat 


1560 


gaatcagcta 


aagcaaaaag 


aacttttacc 


1620 


caatatcctt 


cgagctcctt 


tgaaagcacc 


1680 


atgcggattg 


tacccttcaa 


tctgaaagtc 


1740 


tttccgaaga 


atcctgagct 


ttgggaaagg 


1800 


tggctcgatg 


tgattcaggt 


aaatatgtgc 


1860 


ctataaaccc 


gggggagaaa 


gcagaacagt 


1920 


taaaaacatt 


agaatatgca 


gacaccgcaa 


1980 


ctcttccatt 


tttttgtagc 


aacagcatct 


2040 


cagacctcaa 


gcaattgctc 


ctgtctcagc 


2100 


ccaccatacc 


caacctcagc 


atagcttttg 


2160 


acctgtatag 


atctgttagt 


gcgtatacca 


2220 


gtttcaaagt 


aaaagctggt 


tcatctctta 


2280 


gagactaaac 


tctgcaattt 


gttttcccat 


2340 
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attaaagaac 


tgaagagctc 


agtgtggtag 


gctggcaagt 


cacccttccc 


gagacagccc 


2400 


accttcaggc 


ccgtgatgtg 


cgcaatcatg 


tacgtgagca gggcgtagct 


ggcgatgttg 


2460 


aaaggcacac 


cgaggcccat 


gtctcccgat 


ctctggtaca 


gctggcagga 


cagctcactg 


2520 


ttcaccacat 


agaactggca 


gagggcatgg 


catggaggca gcgccatcag 


aggaagatct 


2580 


cttggattcc 


aagcgcacat 


gatgattctt 


ctgtcgccag 


ggttggtttt 


gatggtgtca 


2640 


atcacccttt 


gcagctggtc 


aactccctgt 


cctgaataat 


ctgattccat 


atctctgtat 


2700 


tctgccccaa 


aatgcctcca 


ctggaagcca 


taaactgggc 


ccaagtcccc 


ttcttctctg 


2760 


gtggagaatc 


ccaggctgtc 


caaaaagtct 


cgggatccat 


tggcatccca 


gattttcact 


2820 


cccttggaag 


acagctcttt 


agcatttgtg 


gatcccttga 


taaaccacag 


caactcctcc 


2880 


aaaacaccct 


tccagaacac 


acgtttggtt 


gtcagcagag 


ggaattcatc 


tctcaggctg 


2940 


tagcgcgcct 


gcatgccgaa 


taccgacagg 


gtgccggtgc 


ccgtgcggtc 


gtccttcctg 


3000 


acgccgcagc 


ggaggatgcg 


ttggatctgc 


cccaggtact 


gcagctcccc 


gtgcggcgga 


3060 


cgcggcccgg 


cgtcccgccc 


ctgtgcggcg 




gccggcgcgg 


cagctccgag 


3120 


ccggccacag 


gcatggcgcg 


gcgggcgggg 


qacggaggca 

9 9 9 99 


qgcgaagtgg 

99 9 9 ^9 


cgcggcggga 


3180 


cggaggcagg 


ccaagtggcg 


cggcgggacg 


qaqqcaqacc 


aagtggcgcg 


gtggcaggac 


3240 


cctcgnccga 


tcaaccgagc 


ctcgtactag 


cca 








<210> 55 
<211> 2967 
<212> DNA 
<213> Homo sapien 












<400> 55 
gccatggtaa 


gcgcggacgc 


catggtaagc 


gcggacgcca 


tggtatggta 


agcgcggacg 


60 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


cggacgccat 


ggtaagcgcg 


gacgccatgg 


120 


taagcgcgga 


cgccatggta 


agcgcggacg 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


180 


cggacgccat 


ggtaagcgcg 


gacgccatgc 


acacggaccc 


tgactactcg gctgcctatg 


240 


tcgtcataga 


aactgatgca 


gaagatggaa 


tcaaggggtg 


tggaattacc 


ttcactctgg 


300 


gaaaaggcac 


tgaagttgtt 


gtctgtgctg 


tgaatgccqt 


cgcccaccat 


gtgctcaaca 


360 


aggaccccaa 


ggacattgtt 


ggtgacttca 


gaggcttcta 


taggcagctc 


acaagtgatg 


420 


ggcagctcag 


atggattggt 


ccagaaaagg 


gcgtggtgca 


cctggcgaca 


gcggccgtcc 


480 


taaacgcggt 


gtgggacttg 


tgggccaagc 


aggagggaaa 


gcctgtctgg 


aagttacttg 


540 


tggacatgga 


tcccaggatg 


ctggtatcct 


gcatagattt 


caggtacatc 


actgatgtcc 


600 


tgactgagga 


ggatgcccta 


gaaacaccgc 


agaaaggtca 


aattggtaaa 


aaagaaagag 


660 
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agaagcaaat 


gctggcacaa 


ggataccctg 


cttacacgac 


atcgtgcgcc 


tggctggggt 


720 


actcagatga 


cacgttgaag 


cagctctgtg 


cccaggcgct 


gaaggatggc 


tggaccaggt 


780 


ttaaagcaaa 


ggtgggtgct 


gacctccagg 


atgacatgcg 


aagatgccaa 


atcatccgag 


840 


acatgactgg 


accggaaaag 


actttgatga 


tggatgccaa 


ccagcgctgg 


gatgtgcctg 


900 


aggcggtgga 


gtggatgtcc 


aagctggcca 


agttcaagcc 


attgtggatt 


gaggagccaa 


960 


cctcccctga 


tgacattctg 


gggcacgcca 


ccatttccaa 


ggcaccggtc 


ccattaggaa 


1020 


ttggcattgc 


cacaggagaa 


cagtgccaca 


atagagtgat 


atttaagcaa 


ctcctacagg 


1080 


cgaaggccct 


gcagttcctc 


cagattgaca 


gttgcagact 


gggcagtgtc 


aatgagaacc 


1140 


tctcagcatt 


gctgatggcc 


aaaaagcttg 


aaattcctgt 


ttgcccccat 


gctggtggag 


1200 


ttggcctctg 


tgaactggtg 


cagcacctga 


ttatatttga 


ctacatatca 


gcttctgcaa 


1260 


gccttgaaaa 


tagggtgtgt 


gagtatgttg 


accacctgcd 


tgagcatttc 


aagtatcccg 


1320 


tgatgatcca 


gcgggcttcc 


tacatgcctc 


ccaaggatcc 


cggctactca 


acagaaatga 


1380 


aggaggaatc 


tgtaaagaaa 


caccagtatc 


cagatggtga 


agtttggaag 


aaactccttc 


1440 


ctgctcaaga 


aaattaagtg 


ctcagcccca 


acaactcttt 


tctttctgaa 


gtgaaagggc 


1500 


ttaaaatttc 


ttggaaatag 


ttttacaaaa 


atggatttaa 


aaaatcctac 


cgatcaagat 


1560 


gagttcagct 


agaagtcata 


ccaccctcag 


gaaccagcta 


aagcaaaaag 


aacttttacc 


1620 


tcggcatcca 


gcccaacccc 


taaagactga 


caatatcctt 


cgagctcctt 


tgaaagcacc 


1680 


ccaaacagcc 


atttccattt 


taatagttgg 


atgcggattg 


tacccttcaa 


tctgaaagtc 


1740 


ttcagctttg 


aagtcatcaa 


ttttctcaac 


ttttcgaaga 


atcctgagct 


ttgggaaagg 


1800 


tctgggttct 


cgctgaagct 


aaaaacaaaa 


taaggccatt 


attttgccat 


aattgtacga 


1860 


cctgttgtaa 


ttgctcctca 


tgtccgtgaa 


acaagtacac 


aggatgtgat 


caacaaagtt 


1920 


ctattttaca 


ggagtatgat 


cctgtcgata 


ccttgccgta 


ggttatgtaa 


catgattgga 


1980 


gcgcaaccag 


ctgttctctt 


gcacagatcg 


agagtgaggg 


gtattttgtg 


acattacaca 


2040 


gcatcaggag 


cctggtgcct 


catcaggtaa 


ctgaagagct 


cagtgtggta 


ggctggcaag 


2100 


tcacccttcc 


cgagacagcc 


caccttcagg 


cccgtgatgt 


gcgcaatcat 


gtacgtgagc 


2160 


agggcgtagc 


tggcgatgtt 


gaaaggcaca 


ccgaggccca 


tgtctcccga 


tctctggtac 


2220 


agctggcagg 


acagctcact 


gttcaccaca 


tagaactggc 


agagggcatg 


gcatggaggc 


2280 


agcgccatca 


gaggaagatc 


tcttggattc 


caagcgcaca 


tgatgattcc 


tctgtcgtca 


2340 


gggttggttt 


tgatggtgtc 


aatcactctt 


tgcagttggt 


caactccctg 


tcctgaataa 


2400 


tctgattcca 


tatctctgta 


ttccgcccca 


aaatgcctcc 


actggaagcc 


ataaactggg 


2460 
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cccaagtccc 


cttcttctct 


ggtggagaat 


cccaggctgt 


ccaaaaagtc 


tcgggatcca 


2520 


ttggcatccc 


agattttcac 


tcccttggaa 


gacagctctt 


tagcatctgt 


ggatcccttg 


2580 


ataaaccaca 


gcaactcctc 


caaaacaccc 


ttccagaaca 


cacgtttggt 


tgtcagcaga 


2640 


gggaattcat 


ctctcaggct 


gtagcgcgcc 


tgcatgccga 


ataccgacag ggtgccggtg 


2700 


cccgtgcggt 


cgtcctccct 


gacgccgcag 


cggaggatgt 


g.ttggatctg 


ccccaggtac 


2760 










cctgtgcggc 


ggggggcaag 


2820 


ggccggcgcg 


gcagctccga 


gccggccaca 


ggcatggcgc 


ggcgggcggg 


ggacggaggc 


2880 


aggcgaagtg 


gcgcggcggg 


acggaggcag 


gccaagtggc 


gcggggcagg 


accccttccg 


2940 


cgcgcctttt 


cccgcggccg 


tcccgcg 








2967 


<210> 56 
<211> 2761 
<212> DNA 
<213> Homo sapien 












<220> 

<221> misc_feature 
<222> (2734) . . (2734) 
<223> n«a, c, g or t 












<400> 56 
gccacggtaa 


gcgcggacgc 


catggtaagc 


gcggacgcca 


tggtatggta 


agcgcggacg 


60 


ccacggtaag 


cgcggacgcc 


atggtaagcg 


cggacgccat 


ggtaagcgcg 


gacgccatgg 


120 


taagcgcgga 


cgccatggta 


agcgcggacg 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


180 


cggacgccat 


ggtaagcgcg 


gacgccatgc 


acacggaccc 


tgactactcg gctgcctatg 


240 


tcgtcataga 


aactgatgca 


gaagatggaa 


tcaaggggtg 


tggaattacc 


ttcactctgg 


300 


gaaaaggcac 


tgaagttgtt 


gtctgtgctg 


tgaatgccct 


cgcccaccat 


gtgctcaaca 


360 


aggacctcaa 


ggacattgtt 


ggtgacttca gaggcttcta 


taggcagctc 


acaagtgatg 


420 


ggcagctcag 


atggattggt 


ccagaaaagg gcgtggtgca 


cctggcgaca gcggccgtcc 


480 


taaacgcggt 


gtgggacttg 


tgggccaagc 


aggagggaaa 


gcctgtctgg 


aagctacttg 


540 


tggacatgga 


tcccaggatg 


ctggtatcct 


gcatagattt 


caggtacatc 


actgatgtcc 


600 


tgactgagga 


ggatgcccta 


gaaatactgc 


agaaaggcca 


aattggtaaa 


aaagaaagag 


660 


agaagcaaat 


gctggcacaa 


ggacaccctg cttacacgac atcgtgcgcc 


tggctggggt 


720 


actcagatga 


cacgttgaag 


cagctctgtg 


cccaggcgct 


gaaggatggc 


tggaccaggt 


780 


ttaaagtaaa 


ggtgggtgct 


gatctccagg 


atgacatgcg 


aagatgccaa 


atcatccgag 


840 


acatgattgg 


accggaaaag 


actttgatga tggatgccaa ccagcgctgg gatgtgcctg 


900 
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aggcggtgga 


gtggatgtcc 


aagctggcca 


agttcaagcc 


attgtggatt 


gaggagccaa 


960 


cctcccctga 


tgacattctg 


gggcacgcca 


ccacttccaa 


ggcactggtc 


ccattaggaa 


1020 


ttggcattgc 


cacaggagaa 


cagtgccaca 


atagagtgat 


atttaagcaa 


ctcctacagg 


1080 


cgaaggccct 


gcagttcctc 


cagattgaca 


gttgcagact 


gggcagtgtc 


aatgagaacc 


1140 


tctcagtatt 


gctgatggcc 


aaaaagtttg 


aaattcctgc 


ttgcccccat 


gctggtggag 


1200 


ttggcctctg 


tgaactggtg 


cagcacctga 


Ctatatttga 


ctacatatca 


gtttctgcaa 


1260 


gccttgaaaa 


tagggtgtgt 


gagtatgttg 


accacctgca 


tgagcatttc 


aagtatcccg 


1320 


tgatgatcca 


gcgggcttcc 


tacatgcctc 


ccaaggatcc 


cggctactca 


acagaaatga 


1380 


aggaggaacc 


tgtaaagaaa 


caccagtatc 


cagatggtga 


agtttggaag 


aaactccttc 


1440 


ctgctcaaga 


aaattaagtg 


ctcagcccca 


acaacttttt 


tcttcctgaa 


gtgaaagggc 


1500 


ttaaaatttc 


ttggaaatag 


ttttacaaaa 


atggatttaa 


aaaatcctac 


cgatcaagat 


1560 


gagtccagct 


agaagtcata 


ccaccctcag 


gaatcagcta 


aagcaaaaag 


aacttttacc 


1620 


tcggcatcca 


gcccaacccc 


taaagactga 


caatatcctt 


cgagctcctt 


tgaaagcacc 


1680 


ctaaacagcc 


atttccattt 


taatagttgg 


acgcggattg 


tacccttcaa 


tctgaaagtc 


1740 


ttcagctttg 


aagtcatcaa 


ttttctcaac 


ttttcgaaga 


atcctgagct 


ttgggaaagg 


1800 


tctgggttct 


cgctgaagct 


gaattttcag 


tggctcgatg 


tgattcaggt 


aaatatgtgc 


1860 


acctcccaaa 


gtgtgtataa 


agtcacctgg 


cttcaggccc 


gtgatgtgcg 


caatcatgta 


1920 


cgtgagcagg 


gcgtagctgg 


cgatgttgaa 


aggcacaccg 


aggcccatgt 


ctcccgatct 


1980 


ctggtacagc 


tggcaggaca 


gctcactgtt 


caccacatag 


aactggcaga 


gggcatggca 


2040 


tggaggcagc 


gccatcagag 


gaagatctct 


tggattccaa 


gcgcacatga 


tgattcttct 


2100 


gtcgtcaggg 


ttggttttga 


tggtgtcaat 


cactctttgc 


agttggtcaa 


ctccctgtcc 


2160 


tgaataatct 


gattccatat 


ctctgtattc 


tgccccaaaa 


tgcctccact 


ggaagccata 


2220 


aactgggccc 


aagtcccctt 


cttctctggt 


ggagaatccc 


aggctgtcca 


aaaagtctcg 


2280 


ggatccattg 


gcatcccaga 


ttttcactcc 


cttggaagac 


agctctttag 


catttgtgga 


2340 


tcccttgata 


aaccacagca 


actcctccaa 


aacacccttc 


cagaacacac 


gtttggttgt 


2400 


cagcagaggg 


aattcatctc 


tcaggctgta 


gcgcgcctgc 


atgccgaaca 


ccgacagggt 


2460 


gccggcgccc 


gtgcggtcgt 


cccLcccgac 


gccgcagcgg 


aggatgcgtc 






caggtactgc 


agctccccgt 


gcggcggacg 


cggctcggcg 


tcccgctcct 


gtgcggcggg 


2580 


gggcaagggc 


cggcgcggca 


gctccgagcc 


ggccacaggc 


atggcgcggc 


gggcggggga 


2640 


cggaggcagg 


cgaagtggcg 


cggcgggacg 


gaggcaggcc 


aagtggcgcg 


gcgggacgga 


2700 


ggcaggccaa 


gtggcgcggc 


ggcaggaccc 


tcgnccgatt 


aaccgagcct 


cgtactagcc 


2760 



wo 2004/050860 



PCT/US2003/040063 



75 



2761 



<210> 57 

<211> 3485 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (345i) . , (3458) 

<223> n=a, c, g or t 



60 
120 
180 
240 
300 



480 

540 
600 



<400> 57 

gccatggtaa gcgcggacgc catggtaagc gcggacgcca tggtatggta agcgcggacg 
ccatggtaag cgcggacgcc atggtaagcg cggacgccat ggtaagcgcg gacgccatgg 
taagcgcgga cgccatggta agcgcggacg ccatggtaag cgcggacgcc atggtaagcg 
cggacgccat ggtaagcgcg gacgccatgc acacggaccc tgactactcg gctgcctatg 
tcgtcataga aactgatgca gaagatggaa tcaaggggtg tggaattacc ttcactctgg 
gaaaaggcac tgaagttgtt gtctgtgctg tgaatgccct cgcccaccat gtgctcaaca 360 
aggacctcaa ggacattgtt ggtgacttca gaggcttcta taggcagctc acaagtgatg 420 
ggcagctcag atggattggt ccagaaaagg gcgtggtgca cctggcgaca gcggccgtcc 
taaacgcggt gtgggacttg tgggccaagc aggagggaaa gcctgtctgg aagttacttg 
tggacatgga tcccaggatg ctggtatcct gcatagattt caggtacatc actgatgtcc 
tgactgagga ggatgcccta gaaatactgc agaaaggtca aattggtaaa aaagaaagag 660 
gtgggttgta agaaaatttt cttcattgtt tttgctaaca ttgtccactt ttgagtgccc 720 
ctgtcctttt ggggtacaca ttgtcttccc aaatgccctg tgctgagcag ctaggccctc 780 
aaatcaacat tcaagtctgc atggtgaagc ctgctgggta tgacctctga ctgcagagtt 840 
tgcttcagcc actgctgaaa ggaagtttgg ctttaggatt acactgtagg gagagccctg 900 
ggggagcagg gcagtccgtg agagtatcct gatcacctgg gtttgacatc ctagtaattt 
gtggctgggt gtgtgtgtgc agggccggat caggagaaca gctggactct ccaggggaaa 
cagcttagct acaggcactt ccaattccga agggccctgg aaagtgcaaa atgttgacgg 1080 
cgctgtgttt tcacagagaa gcaaatgctg gcacaaggat accctgctta cacgacatcg 1140 
tgcgcctggc tggggtactc agatgacacg ttgaagcagc tctgtgccca ggcgctgaag 1200 
gatggctgga ccaggtgagt gtgatgatgg acctgacttt cccagttggc ggcaggagag 1260 
actcaggcag tttaaagtaa aggtgggtgc tgatctccag gatgacatgc gaagatgcca 1320 
aatcatccga gacatgattg gaccggaaaa gactttgatg atggatgcca accagcgctg 1380 



960 

1020 
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aa^ t* at* acct 




agtggatgtc 


caagccggcc 


aagttcaagc 


cattgtggat 


1440 


tcraaaaacca 

^3 J v» WW 


acctcccctg 


atgacattct 


ggggcacgcc 


accatttcca 


aggcactggt 


1500 




ou i«yy^aL> wy 


^ wov»oiy y wy w 


acagtgccac aatagagtga 


tatttaagca 


1560 








ccagattgac 


agttgcagac 


tgqgcaqtgt 


1620 






t" of^ f* fra t" OQC 
uy^^uyouyyu. 


caaaaagttt 


gaaattcctg 


tttgccccca 


1680 






o t" aa act aa t 


gcagcacctg 


attatatttg 


actacatatc 


1740 






auoyyy ^y *»y 


tgagtatgtt 


gaccacctgc 


atgagcattt 


1800 






agcgggctCc 


ctacatgcct 


cccaaggatc 


ccggctactc 


1860 


aoCoyaaa ty 


a a Bf cf a oa a a Y 


ctotaaaaaa 


acaccagtat 


ccagatggtg aagtttggaa 


1920 




r* ^ oc t" f*a a Cf 
^^uy w uwcioy 


aaaat taaot 


gctcagcccc 


aacaactttt 


ttctttctga 


1980 






ctn toaaaata 


gttttacaaa 


aatggattta 


aaaaatccta 


2040 


ccgatcaaga 


tgagttcagc 


tagaagt cat 


accaccctca 


ggaatcagct 


aaagcaaaaa 


2100 


gaacttttac 


ctcggcatcc 


agcccaaccc 


ctaaagactg 


acaatatcct 


tcgagctcct 


2160 


ttgaaagcac 


cctaaacagc 


cat tcccatt 


ttaatagttg 


gatgcggatt 


gtacccttca 


2220 


atctgaaagt 


cttcagcttt 


gaagtcatca 


attttctcaa 


cttttcgaag 


aatcctgagc 


2280 


tttgggaaag gtctgggttc 


tcactaaaac 


taaaaacaaa 


ataaggccat 


tattttgcca 


2340 


taatcgtacg 


acctgttgta 


attgctcctc 


atgtccgtga 


aacaagtaca 


caggatgtga 


2400 


tcaacaaagt 


tccattttac 


aooaatataa 


tcctgtcgat 


accttgccgt 


aggttatgta 


2460 


acatgattgg 


agcgcaacca 


gctgttctcc 


tgcacagatc 


gagagtgagg 


ggtattttgt 


2520 


gacactacac 


agcatcagga 


acctaatacc 
y'^'^^yy ^'y^^ 


tcatcaggta 


actgaagagc 


tcagtgtggt 


2580 


aggctggcaa 


gtcacccttc 


ccgagacagc 


ccaccttcag 


gcccgtgatg 


tgcgcaatca 


2640 


tgtacgtgag 


cagggcgtag 


w wy y ^y u uy w 


tgaaaggcac 


accgaggccc 


atgtctcccg 


2700 


atctctggta 


cagctggcag 


crapacrf t" f"ac 
y ci VffOy w L> %»o 


tgttcaccac 


atagaactgg 


cagagggcat 


2760 


ggcatggagg 


cagcgccatc 


aaacicraaaa 

oy oiy y oidyci w 


ctcttggatt 


ccaagcgcac 


atgatgattc 


2820 


ttctgtcgtc 


agggttggtt 




caatcactct 


ttgcagttgg 


tcaactccct 


2880 


gtcctgaata 


atctgattcc 


atatctctgt 


attctgcccc 


aaaatgcctc 


cactggaagc 


2940 


cataaactgg gcccaagtcc 


ccttcttctc 


tggtggagaa 


tcccaggctg 


tccaaaaagt 


3000 


ctcgggatcc 


attggcatcc 


cagattttca 


ctcccttgga 


agacagctct 


ttagcatttg 


3060 


tggatccctt 


gataaaccac 


agcaactcct 


ccaaaacacc cttccagaac acacgtttgg 


3120 


ttgtcagcag 


agggaattca 


tctctcaggc 


tgtagcgcgc 


ctgcatgccg 


aataccgaca 


3180 
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gggtgccggt 


gcccgtgcgg 


tcgtccttcc 


tgacgccgca 


gcggaggatg 


tgttggatct 


3240 


gccccaggta 


ctgcagctcc 


ccgtgcggcg 


gacgcggctc 


ggcgtcccgc 


tcctgtgcgg 


3300 


cggggggcaa 


gggccggcgc 


ggcagctccg 


agccggccac 


aggcatggcg 


cggcgggcgg 


3360 


gggacggagg 


caggcgaagt 


ggcgcggcgg 


gacggaggca 


ggccaagtgg 


cgcggcggga 


3420 


cggaggcagg 


ccaagtggcg 


cggtggcagg 


accctcgncc 


gattaaccga 


gcctcgtact 


3480 


agcca 












3485 


<210> 58 
<211> 4112 
<212> DNA 
<213> Homo sapien 












<220> 

<221> misc_feature 
<222> (4085) . . (4085) 
<223> n=a, c, g or t 












<400> 58 
gccacggtaa 


gcgcggacgc 


catggtaagc 


gcggacgcca 


tggtatggta 


agcgcggacg 


60 


ccatggtaag 


cocqqacgcc 


atggtaagcg 


cggacgccat 


ggtaagcgcg 


gacgccatgg 


120 


taagcgcgga 


cgccacggta 


agcgcggacg 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


180 


cggacgccat 


ggtaagcgcg 


gacgccatgc 


acacggaccc 


tgactactcg 


gctgcctatg 


240 


tcgtcataga 


aactgatgca 


gaagatggaa 


tcaaggggtg 


tggaattacc 


ttcactctgg 


300 


gaaaaggcac 


tgaagttgtt 


gtctgtgctg 


tgaatgccct 


cgcccaccat 


gtgctcaaca 


360 


aggacctcaa 


ggacattgtt 


ggtgacttca 


gaggcttcta 


taggcagctc 


acaagtgatg 


420 


ggcagctcag 


atggattggt 


ccagaaaagg 


gcgtggtgca 


cctggcgaca 


gcggccgtcc 


480 


taaacgcggt 


gtgggacttg 


tgggccaagc 


aggagggaaa 


gcctgtctgg 


aagttacttg 


540 


tggacatgga 


tcccaggatg 


ctggtatcct 


gcatagattt 


caggtacatc 


actgatgtcc 


600 


tgactgagga 


ggatgcccta 


gaaatactgc 


agaaaggtca 


aattggtaaa 


aaagaaagag 


660 


agaagcaaat 


gctggcacaa 


ggataccctg 


cttacacgac 


atcgtgcgcc 


tggctggggt 


720 


actcagatga 


cacgttgaag 


cagctctgtg 


cccaggcgct 


gaaggatggc 


tggaccaggt 


780 


ttaaagtaaa 


ggtgggtgct 


gatctccagg 


atgacatgcg 


aagatgccaa 


atcatccgag 


840 


acatgattgg 


accggaaaag 


actttgatga 


tggatgccaa 


ccagcgctgg 


gatgtgcctg 


900 


aggcggtgga 


gtggatgtcc 


aagctggcca 


agttcaagcc 


attgtggatt 


gaggagccaa 


960 


cctcccctga 


tgacattctg 


gggcacgcca 


ccatttccaa 


ggcactggtc 


ccattaggaa 


1020 


ttggcattgc 


cacaggagaa 


caggtgagtg 


acgcccccaa 


caggtggatg 


acgtcccctt 


1080 
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ggggtcagta 


cacgccgacc 


agtgaccgag 


gacacagttg 


tgtgttaggc 


tccatcacct 


1140 


gctgtacttt 


gagttgggaa 


attttcatca 


tcttagaaac 


tgggtcatct 


tatcagagtc 


1200 


tagagtcaga 


tatagaaaaa 


gtttgtggct 


atttctccaa 


tctatatgac 


taaggtcagg 


1260 


tatctttttc 


aaagtgtcta 


actgaaatcg 


aaaaggcagc 


aatttaaagt 


tgctactgca 


1320 


agggcagaaa 


atggtctcaa 


gaaagccagc 


tttcaaaccg 


aataaacacg 


actgcgttca 


1380 


ctttttgagc 


ttataaatga 


agcccgagtg 


cctgccaaaa 


cctgctgcag 


tcagcccacg 


1440 


agcagagcag 


cgtgaggagc 


tgattctcag 


ttttcccggc 


aaaaggagca 


atactgctct 


1500 


gccgtggttc 


cgtgttgtca 


tctgtgccac 


ctgctcatca 


ctgtcaccgt 


atttcatcct 


1560 


gacgcttcat 


ctcccactta 


tcagtcgctg 


tgacagtcat 


tccctcataa 


atggcgagcc 


1620 


agtgtgattt 


tgacctgact 


cacactgttg 


cattagcaga 


tttgtaaaga 


agtgagcaca 


1680 


aggtccctgc 


ccacgctata 


aaagctcgcc 


tcatgcccag 


cgagaacaaa 


gaagaaatac 


1740 


agtctgggcc 


tcctgacggc 


cactgatgaa 


taattattgg 


catagagtgg 


ctgcgttgcc 


1800 


aggtttagag 


atcctgaagg 


ccaaggctga 


ctcttctgtt 


ggtgttattt 


tcaattctat 


1860 


ttccagtgcc 


acaatagagt 


gatatttaag 


caactcctac 


aggcgaaggc 


cctgcagttc 


1920 


ctccagattg 


acagttgcag 


actgggcagt 


gtcaatgaga 


acctctcagt 


attgctgacg 


1980 


gccaaaaagt 


ttgaaagtaa 


gcgtgctgca 


gcggctgcag 


accagacctt 


catttcccca 


2040 


ctaatcagac 


acctcccttg 


atggtttgca 


attcacatgc 


atgggagtct 


gtagtttgcc 


2100 


atttcgattt 


ttttctaact 


ctcatttagc 


tttaatccgg 


gaatttttga 


ttttcatctt 


2160 


ggaatttcct 


tcctaaatat 


taataaatga 


tttaatcacc 


tgtgggcaat 


aagaaaaacc 


2220 


agaaagttcc 


ctctcacccc 


ttcctctccc 


tgccctactc 


ttggtattaa 


atagaaacga 


2280 


tttcctttta 


gttcctgttt 


gcccccatgc 


tggtggagtt 


ggcctctgtg 


aactggtgca 


2340 


gcacctgatt 


atatttgact 


acatatcagt 


ttctgcaagc 


cttgaaaata 


gggtgtgcga 


2400 


gtatgttgac 


cacctgcatg 


agcatttcaa 


gtatcccgtg 


atgatccagc 


gggcttccta 


2460 


catgcctccc 


aaggatcccg 


gctactcaac 


agaaatgaag 


gaggaatctg 


taaagaaaca 


2520 


ccagtatcca 


gatggtgaag 


tttggaagaa 


actccttcct 


gctcaagaaa 


attaagtgct 


2580 


cagccccaac 


aacttttttc 


tttctgaagt 


gaaagggctt 


aaaatttctt 


ggaaatagtt 


2640 


ttacaaaaat 


ggatttaaaa 


aatcctaccg 


atcaagatga 


gttcagctag 


aagtcatacc 


2700 


accctcagga 


atcagctaaa 


gcaaaaagaa 


cttttacctc 


ggcatccagc 


ccaaccccta 


2760 


aagactgaca 


atatccttcg 


agctcctttg 


aaagcaccct 


aaacagccat 


ttccatttta 


2820 


atagttggat 


gcggattgta 


cccttcaatc 


tgaaagtctt 


cagctttgaa 


gtcatcaatt 


2880 


ctctcaactt 


ttcgaagaat 


cctgagcttt 


gggaaaggtc 


t999ttct:cg 


ctgaagctaa 


2940 
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aaacaaaata 


aggccattat 


tttgccataa 


ttgcacgacc 


tgttgtaatt 


gctcctcatg 


3000 


tccgtgaaac 


aagcacacag 


gatgtgatca 


acaaagttct 


attttacagg 


agtatgatcc 


3060 


tgtcgatacc 


ttgccgtagg 


ttatgtaaca 


tgattggagc gcaaccagct 


gttctcttgc 


3120 


acagatcgag 


agcgaggggt 


attttgtgac 


attacacagc 


atcaggagcc 


tggtgcctca 


3180 


tcaggtaact 


gaagagctca 


gtgtggtagg 


ctggcaagtc 


acccttcccg 


agacagccca 


3240 


ccttcaggcc 


cgtgatgtgc 


gcaatcatgt 


acgtgagcag 


ggcgtagctg 


gcgatgttga 


3300 


aaggcacacc 


gaggcccatg 


tctcccgatc 


tctggtacag 


ctggcaggac 


agctcactgt 


3360 


tcaccacata 


gaactggcag 


agggcatggc 


atggaggcag 


cgccatcaga 


ggaagatctc 


3420 


ttggattcca 


agcgcacatg 


atgattcttc 


tgtcgtcagg gttggtcttg 


atggtgtcaa 


3480 


tcactctttg 


cagttggtca 


actccctgcc 


ctgaataatc 


tgattccata 


tctctgtatt 


3540 


ctgccccaaa 


atgcctccac 


tggaagccat 


aaactgggcc 


caagtcccct 


tcttctctgg 


3600 


tggagaatcc 


caggctgtcc 


aaaaagtctc 


gggatccatt 


ggcatcccag 


attttcactc 


3660 


ccttggaaga 


cagctcttta 


gcatttgtgg 


atcccttgat 


aaaccacagc 


aactcctcca 


3720 


aaacaccctt 


ccagaacaca 


cgtttggccg 


tcagcagagg 


gaattcatct 


ctcaggctgt 


3780 


agcgcgcctg 


catgccgaat 


accgacaggg 


tgccggtgcc 


cgtgcggtcg 


tccttcctga 


3840 


cgccQcaqcg 


qaqgatgtQt 


tggatctgcc 


ccaggtactg 


cagctccccg 


tgcggcggac 


3900 


gcggctcggc 


gtcccgctcc 


tgtgcggcgg 


ggggcaaggg 


ccggcgcggc 


agctccgagc 


3960 


cggccacagg 


catggcgcgg 


cgggcggggg 


acggaggcag gcgaagtggc gcggcgggac 


4020 


ggaggcaggc 


caagtggcgc 


ggcgggacgg 


aggcaggcca 


agtggcgcgg 


tggcaggacc 


4080 


ctcgnccgat 


taaccgagcc 


tcgtactagc 


ca 






4112 


<210> 59 
<2ia> 2636 
<212> DNA 
<213> Homo sapien 












<400> 59 
gccacggtaa 


gcgcggacgc 


catggtaagc 


gcggacgcca 


tggtatggta 


agcgcggacg 


" 60 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


cggacgccat 


ggtaagcgcg 


gacgccatgg 


120 


taagcgcgga 


cgccatggta 


agcgcggacg 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


180 


cggacgccat 


ggtaagcgcg 


gacgccatgc 


acacggaccc 


tgactactcg 


gctgcctatg 


240 


tcgtcataga 


aactgatgca 


gaagacggaa 


tcaaggggtg 


tggaattacc 


ttcactctgg 


300 


gaaaaggcac 


tgaagttgtt 


gtctgtgctg 


tgaatgccct 


cgcccaccat gtgctcaaca 


360 


aggacctcaa 


ggacattgtt 


ggtgacttca 


gaggcttcta 


taggcagctc acaagtgatg 


420 
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ggcagctcag 


atggattggt 


ccagaaaagg 


taaacgcggt 


gtgggacttg 


tgggccaagc 


tggacatgga 


tcccaggatg 


ctggtatcct 


tgactgagga 


ggatgcccta 


gaaatactgc 


agaagcaaat 


gctggcacaa 


ggataccctg 


actcagatga 


cacgttgaag 


cagctctgtg 


ttaaagtaaa 


ggtgggtgct 


gatctccagg 


acatgattgg 


accggaaaag 


actttgatga 


aggcggtgga 


gtggatgtcc 


aagctggcca 


cctcccctga 


tgacatcctg 


gggcacgcca 


ttggcattgc 


cacaggagaa 


cagtgccaca 


cgaaggccct 


gcagttcctc 


cagattgaca 


tctcagtatt 


gctgatggcc 


aaaaagtttg 


ttggcctctg 


tgaactggtg 


cagcacctga 


gccttgaaaa 


tagggtgtgt 


gagtatgttg 


tgatgatcca 


gcgggcttcc 


tacatgcctc 


aggaggaatc 


tgtaaagaaa 


caccagtatc 


ctgctcaaga 


aaattaagtg 


ctcagcccca 


ttaaaatttc 


ttggaaatag 


ttttacaaaa 


gagttcagct 


agaagtcaca 


ccaccctcag 


tcggcatcca 


gcccaacccc 


taaagactga 


ctaaacagcc 


atttccattt 


taatagttgg 


ttcagctttg 


aagtcatcaa 


ttttctcaac 


tctgggttct 


cgctgaagct 


aaaaacaaaa 


cctgttgtaa 


ttgctcctca 


tgtccgtgaa 


ctattttaca 


ggagtatgat 


cctgtcgata 


gcgcaaccag 


ctgttctctt 


gcacagatcg 


gcatcaggag 


cctggtgcct 


catcaggtgt 


ttaaagacag 


gaacacagtc 


aatctgtaac 


acaatcccta 


acccatctgc 


ccctggcaga 
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gcgtggtgca 


cctggcgaca 


gcggccgtcc 


480 


aggagggaaa 


gcccgtctgg 


aagttacttg 


540 


gcatagattt 


caggtacatc 


actgatgtcc 


600 


agaaaggtca 


aattggtaaa 


aaagaaagag 


660 


cttacacgac 


atcgtgcgcc 


tggctggggt 


720 


cccaggcgct 


gaaggatggc 


tggaccaggt 


780 


atgacatgcg 


aagatgccaa 


atcatccgag 


840 


tggatgccaa 


ccagcgctgg 


gatgtgcctg 


900 


agttcaagcc 


attgtggatt 


gaggagccaa 


960 


ccatttccaa 


ggcactggtc 


ccattaggaa 


1020 


atagagtgat 


atttaagcaa 


ctcctacagg 


1080 


gttgcagact 


gggcagtgtc 


aatgagaacc 


1140 


aaattcctgt 


ttgcccccat 


gctggtggag 


1200 


ttatatttga 


ctacatatca 


gtttctgcaa 


1260 


accacctgca 


tgagcatttc 


aagtatcccg 


1320 


ccaaggatcc 


cggctactca 


acagaaatga 


1380 


cagatggtga 


agtttggaag 


aaactcctcc 


1440 


acaactcttt 


tctttctgaa 


gtgaaagggc 


1500 


atggatttaa 


aaaatcctac 


cgatcaagat 


1560 


gaatcagcta 


aagcaaaaag 


aacttttacc 


1620 


caatatcctt 


cgagctcctt 


tgaaagcacc 


1680 


atgcggattg 


tacccttcaa 


tctgaaagtc 


1740 


ttttcgaaga 


atcctgagct 


tcgggaaagg 


1800 


taaggccatt 


actttgccat 


aattgtacga 


1860 


acaagtacac 


aggatgtgat 


caacaaagtt 


1920 


ccttgccgta 


ggttatgtaa 


catgattgga 


1980 


agagcgaggg 


gtattttgtg 


acattacaca 


2040 


aagttcttac 


aaccactctt 


ggcaaattta 


2100 


tcatagtagc 


tctacgttca 


cttgaattcc 


2160 


aagaaggaaa 


gatgacacgc 


atggacagtg 


2220 
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aacagaaagg 


gatgaaagcc 


aggattcctg 


81 

ggatgaacag 


acagtggcaa 


ttaggatgtg 


2280 


aagacaggtc 


acaacctatt 


actatgtcta 


aaaacgacca 


gagcagagag 


ccagagagaa 


2340 


taagcctgaa 


gtcacctcca 


ctcaaaagca 


gccaaactcc 


ctcaaaggag 


taacttttaa 


2400 


aacccggacc 


^ a 9 o ^ nnik A 

caacct ggaa 


yyy y ^ uacieio 


dy uy ui- uyy u 


tctaaatttt 

^ w k y Cfc^ hi w w w 


tttccttaaa 

www W www ^ 


2460 


gctcatgaag 


cagatgaact 


tacattttta 


ttgccatttc 


atatcaattg 


ttggctgcta 


2520 


taacttaggg 


atctcaacag 


acttttgaag 


tttggaccta 


aatattgtac 


ttaatgtaaa 


2580 


ttaacaaaaa 


atatttatgg 


ccagggtggt 


ggcttatgcc 


tgtaattcca 


gaattt 


2636 


<210> 60 
<211> 2764 
<212> DNA 
<213> Homo sapien 












<220> 

<221> misc_feature 
<222> (2737) . . (2737) 
<223> n=a, c, g or t 












<400> 60 
gccatggtaa 


gcgcggacgc 


catggtaagc 


gcggacgcca 


tggtatggta 


agcgcggacg 


60 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


cggacgccat 


ggtaagcgcg 


gacgccatgg 


120 


taagcgcgga 


cgccatggta 


agcgcggacg 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


180 


cggacgccat 


ggtaagcgcg 


gacgccatgc 


acaicggaccc 


tgactactcg 


gctgcctatg 


240 


tcgtcataga 


aactgatgca 


gaagacggaa 


tcaaggggtg 


tggaattacc 


ttcactctgg 


300 


gaaaaggcac 


tgaagttgtt 


gtctgtgctg 


tgaatgccct 


cgcccaccat 


gtgctcaaca 


360 


aggacctcaa 


ggacattgtt 


ggtgacttca 


gaggcttcta 


taggcagctc 


acaagtgatg 


420 


ggcagctcag 


atggattggt 


ccagaaaagg 


gcgtggtgca 


cctggcgaca 


gcggccgtcc 


480 


taaacgcggt 


gtgggacttg 


tgggccaagc 


aggagggaaa 


gcctgtctgg 


aagttacttg 


540 


tggacatgga 


tcccaggatg 


ctggtatcct 


gcatagattt 


caggtacatc 


actgatgtcc 


600 


tgactgagga ggatgcccta 


gaaatactgc 


agaaaggtca 


aattggtaaa 


aaagaaagag 


660 


agaagcaaat 


gctggcacaa 


ggataccctg 


cttacacgac 


atcgtgcgcc 


tggctggggt 


720 


actcagatga 


cacgttgaag 


cagctctgtg 


cccaggcgct 


gaaggatggc 


tggaccaggt 


780 


ttaaagtaaa 


ggtgggtgct 


gatctccagg 


atgacatgcg 


aagatgccaa 


atcatccgag 


840 


acatgattgg 


accggaaaag 


actttgatga 


tggatgccaa 


ccagcgctgg 


gatgtgcctg 


900 


aggcggtgga gtggatgtcc 


aagctggcca 


agttcaagcc 


attgtggatt 


gaggagccaa 


960 


cctcccctga 


cgacattctg 


gggcacgcca 


ccatttccaa 


ggcactggtc 


ccattfaggaa 


1020 
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ttggcattgc 


cacaggagaa 


cagggtgtgt 


gagtatgttg 


accacctgca 


tgagcatttc 


1080 


aagtatcccg 


tgatgatcca 


gcgggcttcc 


tacatgcctc 


ccaaggatcc 


cggctactca 


1140 


acagaaatga 


aggaggaatc 


tgtaaagaaa 


caccagtacc 


cagatggtga 


agtttggaag 


1200 


aaactccttc 


ctgctcaaga 


aaattaagtg 


ctcagcccca 


acaactttct 


tctttctgaa 


1260 


gtgaaagggc 


ttaaaatttc 


ttggaaatag 


ttttacaaaa 


atggatctaa 


aaaatcctac 


1320 


cgatcaagat 


gagttcagct 


agaagtcata 


ccaccctcag 


gaatcagcta 


aagcaaaaag 


1380 


aacttttacc 


tcggcatcca 


gcccaacccc 


taaagactga 


caatatcctt 


cgagctcctt 


1440 


tgaaagcacc 


ctaaacagcc 


atttccattt 


taatagttgg 


atgcggattg 


tacccttcaa 


1500 


tctgaaagtc 


ttcagctttg 


aagtcatcaa 


ttttctcaac 


ttttcgaaga 


atcctgagct 


1560 


ttgggaaagg 


tctgggttct 


cgctgaagct 


aaaaacaaaa 


taaggccatt 


attttgccat 


1620 


aattgtacga 


cctgttgtaa 


ttgctcctca 


tgtccgtgaa 


acaagtacac 


aggatgtgat 


1680 


caacaaagtt 


ctattttaca 


ggagtatgat 


cctgtcgata 


ccttgccgta 


ggttatgtaa 


1740 


catgattgga 


gcgcaaccag 


ctgttctcct 


gcacagatcg 


agagtgaggg 


gtattttgtg 


1800 


acattacaca 


gcatcaggag 


cctggtgcct 


catcaggtaa 


ctgaagagct 


cagtgtggta 


1860 


ggccggcaag 


tcacccttcc 


cgagacagcc 


caccttcagg 


cccgtgatgc 


gcgcaatcat 


1920 


gtacgtgagc 


agggcgtagc 


tggcgatgtt 


gaaaggcaca 


ccgaggccca 


tgtctcccga 


1980 


tctctiggtac 


agctggcagg 


acagctcact 


gttcaccaca 


tagaactggc 


agagggcatg 


2040 


gcatggaggc 


agcgccatca 


gaggaagatc 


tcttggattc 


caagcgcaca 


tgatgattct 


2100 


tctgtcgtca 


gggttggttt 


tgatggtgtc 


aatcactctt 


tgcagttggt 


caactccctg 


2160 


tcctgaataa 


tctgattcca 


tatctctgta 


ttctgcccca 


aaatgcctcc 


actggaagcc 


2220 


ataaactggg 


cccaagtccc 


cttcttctct 


ggtggagaat 


cccaggctgt 


ccaaaaagtc 


2280 


tcgggatcca 


ttggcatccc 


agatttccac 


tcccttggaa 


gacagctctt 


tagcatttgt 


2340 


ggatcccttg 


ataaaccaca 


gcaactcctc 


caaaacaccc 


ttccagaaca 


cacgtttggt 


2400 


tgccagcaga 


gggaattcat 


ctctcaggct 


gtagcgcgcc 


tgcatgccga 


ataccgacag 


2460 


ggtgccggtg 


cccgtgcggt 


cgtccttcct 


gacgccgcag 


cggaggatgt 


gttggatctg 


2520 


ccccaggtac 


tgcagctccc 


cgtgcggcgg 


acgcggctcg 


gcgtcccgct 


cctgtgcggc 


2580 


ggggggcaag 


ggccggcgcg 


gcagctccga 


gccggccaca 


ggcatggcgc 


ggcgggcggg 


2640 


ggacggaggc 


aggcgaagtg 


gcgcggcggg 


acggaggcag 


gccaagtggc 


gcggcgggac 


2700 


ggaggcaggc 


caagtggcgc 


ggtggcagga 


ccctcgnccg 


attaaccgag 


cctcgtacta 


2760 



2764 
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<210> 61 

<211> 3436 

<212> DNA 

<213> Homo sapien 




<400> 61 
gccatggtaa 


gcgcggacgc 


catggtaagc 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


taagcgcgga 


cgccatggta 


agcgcggacg 


cggacgccat 


ggtaagcgcg 


gacgccatgc 


tcgtcataga 


aactgatgca 


gaagatggaa 


gaaaaggcac 


tgaagttgtt 


gtctgtgctg 


aggacctcaa 


ggacattgtt 


ggtgacttca 


ggcagctcag 


atggattggt 


ccagaaaagg 


taaacgcggt 


gtgggacttg 


tgggccaagc 


tggacatgga 


tcccaggatg 


ctggtatcct 


tgaccgagga 


ggatgcccta 


gaaatactgc 


agaagcaaat 


gctggcacaa 


ggataccctg 


actcagatga 


cacgttgaag 


cagctctgtg 


ttaaagtaaa 


ggtgggtgct 


gatctccagg 


acatgattgg 


accggaaaag 


actttgatga 


aggcggtgga 


gtggatgtcc 


aagctggcca 


cctcccctga 


tgacattctg 


gggcacgcca 


ccggcattgc 


cacaggagaa 


cagtgccaca 


cgaaggccct 


gcagttcctc 


cagattgaca 


tcccagtatt 


gctgatggcc 


aaaaagtttg 


ttggcctctg 


tgaactggtg 


cagcacctga 


gccttgaaaa 


tagggtgtgt 


gagtatgttg 


tgatgatcca 


gcgggcttcc 


tacatgcctc 


aggaggaatc 


tgtaaagaaa 


caccagtatc 


ctgctcaaga 


aaattaagtg 


ctcagcccca 


ttaaaatttc 


ttggaaatag 


ttttacaaaa 


gagttcagct 


agaagtcata 


ccaccctcag 


tcggcatcca 


gcccaacccc 


taaagactga 
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gcggacgcca cggtatggta agcgcggacg 60 

cggacgccat ggtaagcgcg gacgccatgg 120 

ccatggtaag cgcggacgcc atggtaagcg 180 

acacggaccc tgactactcg gctgcctatg 240 

tcaaggggtg tggaattacc ttcactctgg 300 

tgaatgcccc cgcccaccat gtgctcaaca 360 

gaggcttcta taggcagctc acaagtgatg 420 

gcgtggtgca cctggcgaca gcggccgtcc 480 

aggagggaaa gcctgtctgg aagttacttg 540 

gcatagattt caggtacatc actgatgtcc 600 

agaaaggtca aattggtaaa aaagaaagag 660 

cttacacgac atcgtgcgcc tggctggggt 720 

cccaggcgct gaaggatggc tggaccaggt 780 

atgacatgcg aagatgccaa atcatccgag 840 

tggatgccaa ccagcgctgg gatgtgcctg 900 

agttcaagcc attgtggatt gaggagccaa 960 

ccatttccaa ggcactggtc ccattaggaa 1020 

atagagtgat atttaagcaa ctcctacagg 1080 

gttgcagact gggcagtgtc aatgagaacc 1140 

aaattcctgt ttgcccccat gctggtggag 1200 

ttatatttga ctacatatca gtttctgcaa 1260 

accacctgca tgagcatttc aagtatcccg 1320 

ccaaggatcc cggctactca acagaaatga 1380 

cagatggtga agtttggaag aaactccttc 144 0 

acaacttttt tctttctgaa gcgaaagggc 1500 

atggatttaa aaaatcctac cgatcaagat 1560 

gaatcagcta aagcaaaaag aacttttacc 1620 

caatatcctt cgagctcctt tgaaagcacc 1680 
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ctaaacagcc 


atttccattt 


taacagttgg 


atgcggattg 


tacccttcaa 


tctgaaagtc 


1740 


ttcagctttg 


aagtcatcaa 


ttttctcaac 


ccttcgaaga 


atcctgagct 


ttgggaaagg 


1800 


tctgggttct 


cgctgaagct 


aaaaacaaaa 


taaggccatt 


attttgccat 


aattgtacga 


1860 


cctgttgtaa 


ttgctcctca 


tgtccgtgaa 


acaagtacac 


aggatgtgat 


caacaaagtt 


1920 


ctattttaca 


ggagtatgat 


cctgtcgata 


ccttgccgta 


ggctatgtaa 


cacgattgga 


1980 


gcgcaaccag 


ctgttctctt 


gcacagatcg 


agagtgaggg 


gtattttgtg 


acattacaca 


2040 


gcatcaggag 


cctggtgcct 


catcaggtaa 


ctgaagagct 


cagtgtggta 


99ctggcaag 


2100 


tcacccttcc 


cgagacagcc 


caccttcagg 


cccgtgatgt 


gcgcaatcat 


gtacgtgagc 


2160 


agggcgtagc 


tggcgatgtt 


gaaaggcaca 


ccgaggccca 


tgtctcccga 


tctctggtac 


2220 


agctggcagg 


acagctcact 


gttcaccaca 


tagaactggc 


agagggcatg 


gcatggaggc 


2280 


agcgccatca 


gaggaagata 


ggaaccagca 


gaggaagaca 


aggagggatg 


gtggtttgaa 


2340 


agaccacagc 


taaaggcaaa 


gtaaaacagg 


agagaaacag 


aagccaactc 


atatggtgga 


2400 


gaccaggaga 


gagagccact 


gggctgcagt 


gatgtccata 


acagcctctg 


cagcgatggc 


2460 


acggagccga 


gggagactat 


ccatcggtgc 


aaggtttctg 


caggtgtcca 


tttacggctg 


2520 


aagcaatgct 


cttccatcag 


agctgaaggg 


atctgggcta 


cctcgtggca 


ccagattaca 


2580 


aatacagcag 


gaataattct 


gtttgccaca 


ggaaactggt 


gcttctggta 


caccctccta 


2640 


tattaaaagt 


ctctattaca 


tggccaggca 


cagtggctca 


tgcctgaaat 


cccaacactg 


2700 


ggacgccagg 


gtgggcagat 


cacttgaggc 


caacttgagt 


tcgagactag 


cccggccaac 


2760 


atagtgaaac 


cccgtctctg 


ctaaaaatac 


aaaaattcgc 


gcgcctgtgg 


tcccagctct 


2820 


ggggctgagg 


aatgggaact 


cacttgaacc 


tgggagatgg 


aggttgccgt 


gagctggagt 


2880 


gcaatggcgt 


gaccatgact 


cactacagcc 


tcaaattcct 


gggttcaaat 


gatcctctcg 


2940 


ccttggcctc 


tggagttgtt 


ggaattacag 


gtgtgagcca 


ccacacctgg 


ctctacaccc 


3000 




gaattttaca 


atataactgt 


taaaaatgtt 


agtgaggtgg 


cagcgagctg 


3060 


gaacctcatc 


tgcttccatt 


taaaatgtta 


acatcttaca 


tgtagtttct 


aggcaattta 


3120 


aaaccctttt 


tggtgtaaga 


atgagtataa 


acacacaaat 


taagatcatc 


tggttgggag 


3180 


gctgaggcag 


gcagatcatc 


tgaggtcagg 


agctcaagac 


cagcctgacc 


aacatggtga 


3240 


aaccccgtct 


ctactaaaaa 


tacaaaatta 


gccgggcatg 


gtggcacatg 


cctgtaatcc 


3300 


cagctacttg 


ggaggctgag 


gcaggagaat 


cactcgaacc 


caggaggcag 


aggttacagt 


3360 


gagccaagat 


cacgccgttg 


cattccagcc 


tgggcaacag 


agcaaaactc 


tgtctcaaaa 


3420 


aaaaaaaaaa 


aaaaaa 










3436 
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<210> 62 
<211> 2928 
<212> DNA 
<213> Homo sapien 




<400> 62 
gccatggtaa 


gcgcggacgc 


catggtaagc 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


taagcgcgga 


cgccatggta 


agcgcggacg 


cggacgccat 


ggtaagcgcg 


gacgccatgc 


tcgtcataga 


aaccgatgca 


gaagatggaa 


gaaaaggcac 


tgaagttgtt 


gtctgtgctg 


aggacctcaa 


ggacattgtt 


ggtgacttca 


ggcagctcag 


atggattggc 


ccagaaaagg 


taaacgcggt 


gtgggacttg 


tgggccaagc 


tggacatgga 


tcccaggatg 


ctggtatcct 


tgactgagga 


ggatgcccta 


gaaatactgc 


agaagcaaat 


gctggcacaa 


ggataccctg 


actcagatga 


cacgttgaag 


cagctctgtg 


ttaaagcaaa 


ggcgggtgct 


gatctccagg 


acatgactgg 


accggaaaag 


actttgatga 


aggcggtgga 


gtggacgtcc 


aagctggcca 


cctcccctga 


tgacactctg 


gggcacgcca 


ttggcattgc 


cacaggagaa 


cagcgccaca 


cgaaggccct 


gcagttcctc 


cagattgaca 


tctcagtatt 


gctgatggcc 


aaaaagtttg 


ttggcctctg 


tgaactggtg 


cagcacctga 


gccttgaaaa 


tagggtgtgt 


gagtatgttg 


tgatgatcca 


gcgggcttcc 


tacatgcctc 


aggaggaatc 


tgtaaagaaa 


caccagtatc 


ctgctcaaga 


aaattaagtg 


ctcagcccca 


ttaaaatttc 


ttggaaatag 


ttttacaaaa 


gagttcagct 


agaagtcata 


ccaccctcag 
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gcggacgcca 


tggtatggta 


agcgcggacg 


60 


cggacgccat 


ggtaagcgcg 


gacgccatgg 


120 


ccatggtaag 


cgcggacgcc 


atggtaagcg 


180 


acacggaccc 


tgactactcg 


gctgcctatg 


240 


tcaaggggtg 


tggaattacc 


ttcactctgg 


300 


tgaatgccct 


cgcccaccat 


gtgctcaaca 


360 


gaggcttcta 


taggcagctc 


acaagtgatg 


420 


gcgtggtgca 


cctggcgaca 


gcggccgtcc 


480 


aggagggaaa 


gcctgtctgg 


aagttacttg 


540 


gcatagattt 


caggtacatc 


actgatgtcc 


600 


agaaaggtca 


aattggtaaa 


aaagaaagag 


660 


cttacacgac 


atcgtgcgcc 


tggctggggt 


720 


cccaggcgct 


gaaggatggc 


tggaccaggt 


780 


atgacatgcg 


aagatgccaa 


atcatccgag 


840 


tggatgccaa 


ccagcgctgg 


gatgtgcctg 


900 


agttcaagcc 


attgtggatt 


gaggagccaa 


960 


ccatttccaa 


ggcactggtc 


ccattaggaa 


1020 


atagagtgat 


atttaagcaa 


ctcctacagg 


1080 


gttgcagact 


gggcagtgtc 


aatgagaacc 


1140 


aaattcctgt 


ttgcccccat 


gctggtggag 


1200 


ttatatttga 


ctacatatca 


gtttctgcaa 


1260 


accacctgca 


tgagcatttc 


aagtatcccg 


1320 


ccaaggatcc 


cggctactca 


acagaaatga 


1380 


cagatggtga 


agtttggaag 


aaactccttc 


1440 


acaacttttt 


tctttctgaa 


gtgaaagggc 


1500 


atggatttaa 


aaaatcctac 


cgatcaagat 


1560 


gaatcagcta 


aagcaaaaag 


aacttttacc 


1620 
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tcggcatcca gcccaacccc taaagactga caatatcctt cgagctcctt tgaaagcacc 1680 

ctaaacagcc atttccatcc taatagttgg atgcggattg taccctccaa tctgaaagtc 1740 

ttcagctttg aagtcatcaa ttttctcaac ttttcgaaga atcctgagcc ttgggaaagg 1800 

tctgggttcc cgctgaagct aaaaacaaaa taaggccatt attttgccat aattgtacga 1860 

cctgttgtaa ttgctcctca tgtccgtgaa acaagtacac aggatgtgat caacaaagtt 1920 

ctattttaca ggagtatgat cctgtcgata ccttgccgta ggttatgtaa catgattgga 1980 

gcgcaaccag ctgttctctt gcacagatcg agagtgaggg gtattttgtg acattacaca 204 0 

gcatcaggag cctggtgcct catcaggtaa ctgaagagct cagtgtggta ggctggcaag 2100 

tcacccctcc cgagacagcc caccttcagg cccgtgatgt gcgcaatcat gtacgtgagc 2160 

agggcgtagc tggcgatgtt gaaaggcaca ccgaggccca tgtctcccga tctctggtac 2220 

agctggcagg acagctcact gttcaccaca tagaactggc agagggcacg gcatggaggc 2280 

agcgccatca gaggaagata ggaaccagca gaggaagata aggagggatg gtggtttgaa 234 0 

agaccacagc taaaggcaaa gtaaaacagg agagaaacag aagccaactc atatggtgga 2400 

gaccaggaga gagagccact gggctgcagt gatgtccata acagcctctg cagcgatggc 2460 

acggagctga gggagactat ccatcggtgc aaggtttctg caggtgtcca tttacggctg 2520 

aagcaatgct cttccatcag agctgaaggg atctgggcta cctcgtggca ccagattaca 2580 

aatacagcag gaataattct gtttgccaca ggaaactggt gcttctggta caccctccta 2640 

tattaaaagt ctctattaca tggccaggca cagtggctca tgcctgaaat cccaacactg 2700 

ggacgccagg gtgggcagat cacttgaggc caacttgagt tcgagactag cctggccaac 2760 

atagtgaaac cccgtctctg ctaaaaatac aaaaattcgc gcgcctgtgg tcccagctct 2820 

ggggctgagg aatgggaact cacttgaacc tgggagatgg aggttgccgt gagcggagat 2880 

cacaccactg cactccagcc tgggcaacag ggtgagactg tgtctcaa 2928 

<210> 63 
<211> 3100 
<212> DNA 
<213> Homo sapien 

<400> 63 

gccatggtaa gcgcggacgc catggtaagc gcggacgcca tggtatggta agcgcggacg 60 

ccatggtaag cgcggacgcc atggtaagcg cggacgccat ggtaagcgcg gacgccatgg 120 

taagcgcgga cgccatggta agcgcggacg ccatggtaag cgcggacgcc atggtaagcg 180 

cggacgccat ggtaagcgcg gacgccatgc acacggaccc tgactactcg gctgcctatg 240 

tcgtcataga aactgatgca gaagatggaa tcaaggggtg tggaattacc ttcactctgg 300 
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ugaagcLgc u 


gcccgcgccg 


L,gciaugcccu 


oooooa ooa 

cy ^wCauca t 


y uycLcaaca 


•> O V 


aCj^aCCuCoa 


ggacac ugcc 


gytgact tea 


yciggc 1 1 cua 


L. ay y Coy cue 


a oa an^ na fr* o 

acaay cyauy 


420 




acggaccggi. 


^^arta aa a 

ccagciaaogy 


ycy tyy uycrt 


cc i.yy cyaca 


ycyyccy ccc 


480 


^ A ^ O ^^^^^^ 

caaacgcggu 


gcgggacc ug 


Lyygccciciy w 


a nna nnnA a a 
oiyyoiy yy cicici 


ycc uy ucuyy 


aafi^^a^t* t*CT 
aayw uaei. ^3 


540 


tggacatgga 


^ ^ ^ ^ ^ ^ ^ 

CCCCayyaCy 


Ccgy LaCCC u 


<^/^a^* a/^a^ ^ f* 

y Ca C ag a C L C 


Cciyy LciCaLC 


acuyauytcc 


600 


tgactgagga 




gaaacac tgc 


a ^ a a t* ^ a 

agaaagguca 


a a ^ ^ a a a 

aaCCggCaaa 


aaaoaaa^^an 

ooay aaay ay 


66 0 


agoagCaaau 


gccggcacaa 


ggacaccccg 


^ 9 ^a ^^9 ^ 

cacacgac 


a oM ^ ^ on o o 

accgcgcgcc 


cggctggggc 


720 


acucagacgoi 


^ a ^ ^ ^Ya a 

cacgt ugoag 


cogcucugug 


cccaggcgcu 


na a rtrra ^ 

yaayyauyy c 


cyy acv-ayy c 


780 


1" ^ a a a ^ t* a 9 o 

u taaaguaao 


ggcgggcgcc 


ga ucuccagy 


a ^ na ^a ^ nf*n 
oi ty awci uy t«y 


aana^m^r*aa 
aayai>yt.caa 




840 


a ca cga c ^gg 


a ^^^na a a a^ 

accggciciaoig 


a^H^ ^«a ^ na 

acu L Lyciuyo 


^ nna ^ n a 
ugya uyc caa 


cuay cy c Lyy 


yauyuyww^y 


900 


aggcggcgga 


gtggotgtcc 


a an^^ rtn^ oa 
aoycuyy cca 


ay u wCaaycw 


a t* ^ o h nna ^ ^ 
au uy uyyau u 


y ay y ay eeaa 


960 


CCCCCCCuga 


cgowat.ccug 




tv»ai.i.ut.t»aa 


yycacuyyuw 


ev»at» cayyaa 


1020 


c cggcac cy c 


CaCaygayaa 


Cdy cy CCaCa 


a(,ciyayL.yai. 




c L.^ e L.a wayy 


1080 


cgaaggcccc 


CjCa^U CCCCC 


^ a a ^ a ^ a 

cagac cgaca 


^rt^a/^a/*»^ 

guL.ycciyaut 


y y y Cay uy u c 


aa uyayaav^^ 


114 0 


tcccagcat t 


gccga tggcc 


a a a a anf* 1* n 

aaaaagc c ug 


aaat cwCLgu 


ccycccccac 


CTr»t"an^CToaa 
gccggcggdg 


1200 


ccggcccccg 


cgaacuggcg 


cagcacccga 


^^at-a^^t- na 


o^aoa^a^oa 
ccacacocca 


y u u ccuy caa 


1260 


gccctigaaaa 


cagggtgi-gL. 


yagcatgccg 


a ^^a ^^^/^^a 

accaccugco 


cyay cac cue 


aay caccccy 


X J ^ u 


cgacgatcca 




^ a ^a ^ rt^/^ t" ^ 

tacougcc uC 


a ^rta ^ 

ccaagya t. 


f^nn o H a o ^ ^ a 
cyy cuacucei 


af*aaaaat'na 

Ok\^9'>;^^<3iOi uy a 


13 80 


aggaggaatic 




/^a ^^a a f* ^ 

caccagcacc 


^ a or a ♦* 1" or a 

Cay a uyy uya 


a rvt* t* ^ nna af^ 
ay C u uyyaay 


aaar*l"^r»f" f* 
aaaccccccc 


1440 


ccgcCcaaga 


aaac uaagug 


cucagcccca 


a oa a o^ ^ ^- H ^ 

acaact. uu t u 


t> o^ ^ ^ o^na a 

tcccccugaa 


nt* na a annn^ 
y cyaaayyyc 


X 3 U V 


uuaaclaCCuC 


^ ^ a a a ^ a ^ 

u L, y y a oa L ay 


^♦"♦"♦"a^aaaa 
u U L. UaCcicicla 


a ^ noa 1" ^ ^ a a 
a Lyya t. u caa 


aaaa^/^/^^'a/^ 
aaaaccccac 


/^rta ^ ^aana ^ 
cyaucaayac 


1 ^fiO 
J. SO V 


gage L cage c 


a a a ^ ^ ^ a ^ a 

agaagccacd 


CCaCCCCCog 


oaa ^ oa/^^^ a 

y o oi u c ay c L. a 


a a r^oa a a a a 

aay caaaaay 


aacuuLuacc 


162 0 


u C39^ ca c c cd 


gcccaacccc 


^aaarta^^ ^a 

uaaayaCty ci 


/*aa^a^/^/^^ ^ 


cyoycucccc 


^oaaan^a^^^^ 
cy aaay ea 


1660 


^ ^ A A ^ ^ a ^^^^^ 
ccaaacagcc 


a^^^^^at*t*^ 

auLLCcaccc 


uaataguugg 


atgcggat-tg 


f*a^oo^ ^ oaa 
Caccci* ccaa 


I* c uy aaay LtC 


1740 


cccagccccg 


aagtcatcaa 


Cucccccaac 


c c t c cgaaga 


acccugagcc 


^ ^ nnna a a na 
t cgy y aaayy 


X O U w 


tctgggttct 


cgctgciagct 


aaaaacaaaa 


taaggccaCt 


accccgccau 


a a ^ ^nh'a^na 
aaucy cacya 


1 RAH 
X oo u 


cctgttgtaa 


ttgctcctca 


tgtccgtgaa 


acaagtacac 


aggatgtgat 


caacaaagtt 


1920 


ctattttaca 


ggagcatgat 


cctgtcgata 


ccttgccgta 


ggttatgtaa 


catgattgga 


1980 


gcgcaaccag 


ctgttctctt 


gcacagatcg 


agagtgaggg 


gtattttgtg 


acattacaca 


2040 


gcatcaggag 


cctggtgcct 


catcaggtaa 


ctgaagagct 


cagtgtggta 


ggctggcaag 


2100 


tcacccttcc 


cgagacagcc 


caccttcagg 


cccgtgatgt 


gcgcaatcat 


gtacgtgagc 


2160 
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agggcgtagc 


tggcgatgtt 


gaaaggcaca 


agctggcagg 


acagctcact 


gttcaccaca 


agcgccatca 


gaggaagatc 


tcttggattc 


gggttggttt 


tgatggtgtc 


aatcactcct 


tctgattcca 


tacctctgta 


ttctgcccca 


cccaagtccc 


cttcttctct 


ggtggagaat 


ttggcatccc 


agattttcac 


tcccttggaa 


ataaaccaca 


gcaactcctc 


caaaacaccc 


gggaattcat 


ctctcaggct 


gtagcgcgcc 


cccgtgcggt 


cgtccttcct 


gacgccgcag 


tccaggtccc 


agatgaagag 


acaattccac 


taccttaaaa 


aaaaaaaaaa 


ggcagcaatg 


cagtgtattt 


tttcttggag 


taaaagtcca 


tttggaaaaa 


ctggtactgt 


atgtagcaaa 


ttaacaattt 


gaagaaatcg 


tcacagcttc 


ttcctctgat 


gtgataataa 




<210> 64 

<211> 463 

<212> DNA 

<213> Homo sapien 




<400> 64 
tggatcgtgg 


tcgcggcgag 


gtacaggact 


taggtgtact 


acacgcgcct 


gggcaacgac 


atcgccatta 


tccccagcaa 


aaagctccgc 


atgaagcgaa 


ttcagagagg 


cccagtaaga 


agagaaagga 


gagacaatta 


tgttcctgag 


gtagatcctg 


acactaagga 


aatgctgaag 


caggtcactc agcctacagt 


tgggatgaat 


ttttctgtag 


tgctgtatta 


ttttcaataa 



<210> 65 

<211> 5226 

<212> DNA 

<213> Homo sapien 
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ccgaggccca 


tgtctcccga 


tctctggtac 


2220 


tagaactggc 


agagggcatg 


gcatggaggc 


2280 


caagcgcaca 


tgatgattct 


tctgtcgtca 


2340 


tgcagttggt 


caactccctg 


tcctgaataa 


2400 


aaatgcctcc 


actggaagcc 


ataaactggg 


2460 


cccaggctgt 


ccaaaaagtc 


tcgggatcca 


2520 


gacagctctt 


tagcatttgt 


ggatcccttg 


2580 


ttccagaaca 


cacgtttggt 


tgtcagcaga 


2640 


tgcatgccga 


ataccgacag 


ggtgccggtg 


2700 


cggaggatgt 


gttggatctg 


ccccaggatt 


2760 


ttactggata 


taggatgcat 


tagatcttct 


2820 


atcaaaatac 


taataaatta 


ctcacagact 


2880 


ggatgggtaa 


tagaatacct 


gctgttggct 


2940 


ataatgtgaa 


acccatatgc 


atggatattc 


3000 


cctgggttgt 


tgagcctcta 


agatggtctt 


3060 


cCgaactcta 






3100 


agactgggtt 


tatatttaac 


tcttgcctca 


60 


ttccacacga 


acaagcgcgt 


gcgcgaggag 


120 


aacaagatag 


caggttatgt 


cacgcatctg 


180 


ggtatctcca 


tcaagctgca 


ggaggaggag 


240 


gtctcagcct 


tggatcagga gattattgaa 


300 


cttttggact 


tcggcagtct 


gtccaacctt 


360 


ttcaaaacgc 


ctcggggacc 


tgtttgaatt 


420 


atctgggaca 


aca 




463 
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gcggtggcgg 


tggcggccgc 


atgccctggg 


ccatgatctg 


gagggcgccg 


accgcgtcga 


ggctgcttcc 


cggagggtcg 


ccgccgctct 


ccgccaagga 


cccgaggcgg 


ccccgcgcgc 


gagaggcccc 


cgcgcctcgg 


cagccgccga 


cgccgcagcc 


aagctagagg 


agccgcccga 


gtgccgccca 


ccctggtgcc 


ggcctcggcg 


gccgcgctgc 


ctgggctccg 


cgcagcccac 


agcgctgccg 


tggagcacga 


cctgcgcacc 


ggggcccggc 


gcggaaggtg 


cgggcaagag 


accgtgcgcg 


acacgctgct 


aagtttaaga 


aggagccggc 


gccaacgggt 


ctaaagcagg 


gggacggaga 


gcagaggaag 


caacaaacac 


cccacttcct 


tcatgtgctg 


aagcagcata 


ggttttcctt 


tcccagcaca 


ctattggggg 


acattagcag 


tacaaacagg 


cagttacaag 


cctcagaatc 


ccgtacgcct 


agtattttct 


gtctgtgctt 


aggaagagaa 


tgatctcttt 


tttgtgtttc 


cagttgcaag 


gaagtcgggc 


cccctaagat 


gggctcaaga 


tccccatgac 



atcccacreao 


woyu^y uyv,w 






60 




ycici^^tiv^y u 


y^^yu>yyv>uy 


v.y L.u»aGiWrU<3w 


120 


^y ^ ^ » w ^ 


yay a^y y ^y 




y \.yyy (.y Vi.a^ 


180 




yycgccgcgy 


v^^^u^yy^uyv- 


^yL>v^ciciy^cy 


24 0 




^aywoyvvdyw 


ay^oy^»u> ^yy 


ppa poapoop 
^..^o^yyv^y^v.* 


300 


y ^*^y y ^ 


i> ^y y ^y ^y \v u 


oi w wwy c cy y v» 


aoppcipoiapp 
yy^w^^^Mww 


360 










420 


y ^ wy w^*»y ^y 


aa^yyi^oiviv^ 


cyy k.y w^^aa 


POCtPt* t* Pf PP 

W^«j^ ^ W \« b> w 


480 




\^ W ^ 


61 Sv\v%«^ ^y ij y 


wy y w wo ^y 


540 




y y w wy y ^ wei 


v>y ^ v«y (■> i^y 


pa ccacac t* p 
wciw wy w ^ w 


600 


^y u c uoi 


y wu.y ^y y uy l. 


^^yy cici^wy ^ 


yywwy^v^ciyw 


660 


cy ai^i^ uy yy c 




yy Sv^yy s_>a u w 


cnX" a t" poacfp 
^y ^y vwyoyw 


720 


y (..ay uy cyay 


CT floor's cicc^ 


aoaa era a a pa 




780 


^y a^ciy w ^y 


k>^v>pCiw^y\vyy 


^^yyyy v-v-y s» 


pcraapt"Ciaap 


840 


w^-y wy y y u\^u 


yyayagt-ayy 


a r'h' oacf t" pa a 




900 


y y ^-y L.y Vi^o V 


i»ay wciv«yy^v» 


a^u^yyyy 


pt" t" pflaoaop 
w v. L. wy ay ay w 


960 


uyau uy 


yycayy u uy u 


*-3yy ^^y** 


ooppa a caaa 
yy wwaav^yyy 


1020 


t-ayyyycggc 


h rt f" rta a o h fta 
uyuyciayuya 


yyyyy*-*-^«*y 


naaaaaaaacT 
y y y ay aaaay 


1060 


yy tyy tuct t 


i>yyci^u^aww 


at"!" ^t"ar*Pfr* 


ay wv>v»ay aaa 


1140 


yciu%^i.ys»^uy 


ciyy ^y y cici^ w 


ay uyui- ^yy ^ 


yy waawy (^y u 


1200 


d^ciyciyo ^ya 


y uv^oyciv^ ^yy 


ap^^oa ^ ar*cTP 
ywuyo uciivy^ 


t" p t" ^la pa PCTO 
c^L.ya^^ai..yy 


1260 


ut«^uuyyauy 


Ofia fir' a t" rta 
yyciyv»ciuyciy 


no Pa p r^a h p 
yy^a(»^^ciycv» 




X ^ ^ V/ 


w^ciwouy^uy 


ao^ Of a aa 
ciy wy wBcici^y 


aoonhapl" 1" ^ 
oyyy^ci^wwu 


uy ^y wauy ^y 


1380 


wy uyuKi>ciuuu 


u ^oy uyy wuv» 


pat" ^ fr" t"aaa^ 


pa CT ^ p ^ ap ^ a 
v«ay u w uy v> uy 


144 0 


gaaggtttta 


agttgcatgt 


gcacctgaaa 


ctcgtatatg 


1500 


ttagagagga 


ggaattctgt 


aacgactttt 


gtttcgggtt 


1560 


cagtgcaccg 


ccacttatgt 


tacctttttc 


cttttatttc 


1620 


aacagcaagg 


aaaaggaagc 


cctctccaga 


accagaaggt 


1680 


caacggagag 


gcccagccgt 


ggctgtccac 


atccacagag 


1740 


tcctacatcc 


tcttttgtgt 


ctccgccacc 


acccactgcc 


1800 
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tcacctcatt 


ccaaccggac 


cacaccgcct 


gcagccctga 


tcttagtagc 


agacaatgca 


caggttcact 


ccactaccag 


gaggaatagc 


caaagaaggc 


tgggccccag 


agaggtgggg 


gagccagtgc 


accctgccag 


cctcccggac 


tgcaccccct 


gccacgagcg 


gctggaggac 


tcgcacaagt 


tctgcttccc 


ttgctccaga 


gaggtctatt 


gtcccagtgg 


ggaaaaatgc 


tttatgcaag 


gggaaattgc 


aaccatcctt 


gactcgtgac 


ttttccggtt 


tcagaaaaac 


aattgtatat 


atatctccat 


atatatatat 


aacatggctg 


tataattttg 


attttttttg 


gacaaaaggt 


atgtacttat 


aaagacattc 


ttgtgcctca 


ttatcctggt 


gacagttacc 


ttttagagca 


cttggcaaat 


cagaaatgct 


aaagaaaaaa 


aaagccaaat 


acattctctg 


tccttattgc 


tggccccttt 


ctcaggccgg 


atcgaacggg 


catgttgtca 


agtgggcatg 


aacattgtcc 


tcagaggagt 


aggaaagtgg 


agttcctgag 


atactgatga 


ctgagagtgc 


tttggggttt 


tttgtttttt 


tttttttaag 


gatcagttgt 


aagattacaa 


tgctgcatgc 


atggaaggcg 


gttataatgg 


atggtggtgt 


agcccagctc 


tgcagcgcca 


cttcatcttt 


ctgttgtcct 


ttattttgaa 


agagttgcaa 


ttgtgggatt 


ggttttgttt 


ctgttttgtt 


tgttgatact 


atcacctata 


ccctgtttct 


acagtaacag 


ccgcgtggtg 


ctgccaggac 


tgtgggtgga 


ggaaacctgt 


aatttccttc 


atactgttct 


ggaaaaaaat 


gataaactag 
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gaagcggccc agaatggcca 


gtcccccatg 


1860 


gggggcagtc atgcctcaaa 


agatgccaac 


1920 


aacagtccgc cctctccgtc 


ctctatgaac 


1980 


ggccagggag caggcaacac 


aggaggactg 


2040 


tcctctctgg caaccagtgc 


cccgctgtgc 


2100 


acccattttg tgcagtgccc gtccgtccct 


2160 


caaagcatca aacagcaggg 


agctagtgga 


2220 


cctcttgtgg gctccaatgt 


cccctgggcc 


2280 


gctggagatg tgaaagtgaa 


aaaagagaga 


2340 


ccaatgatta cccttaatta 


aaactgcttg 


2400 


atccaagaca agggaaatgt 


agacttcata 


2460 


aatacattgt gtttctatat 


tttttttgac 


2520 


ttttcctttg ttaacgttat 


tagcatatct 


2580 


gttctatgta ggctgtgact 


tgcgctgctt 


2640 


tctagctgta tttgtatgca 


cttattttaa 


2700 


acattgtaag attgccttac 


tgtctgtcat 


2760 


aggccaagtg gtggagaagg aaaggaaatg 


2820 


ccactgggaa ataccaccag 


tttaccctga 


2880 


attttgaatc tctattttgc 


tcaaaagttc 


2940 


tgctgggaaa ttttcaggat 


tgtgtggtct 


3000 


acaaagttga ccgctgctca 


ctgtccacgt 


3060 


tagttggtta cataagatac 


aattccagcg 


3120 


gtacaagatg gcactgccat 


ctttgagcag 


3180 


ttaaacaccc tagaggtctg tttgttgttg 


3240 


gagaagttac agtccaggtg aacttggaga 


3300 


ttgtttatca tttacctgta gtgctattgc 


3360 


agtgagtgct gaatacagta 


tggtacaatg 


3420 


tgcccttggg catatcagtg 


acagcccaaa 


3480 


ttaacatgtg tttgaaatac 


caagtgaata 


3540 


tggaaattaa agaaattaag ggttttatat 


3600 
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aatagacagg ccccacctct 
tgatcactat tttgcatcag 
gaaatagttg tctcacagtg 
caggaacact attttaaatc 
tgaattatgg cttcaaggtc 
catttgaaaa tacagattct 
cctttttaaa ttgttcaaaa 
atactgcacc aaatacagtg 
gtgcgagaat tcatgttcta 
cagttgttac acttactttt 
cttatgtgaa tgcacacatg 
cgcaaagaat tcagctctta 
actggaattt gacctcacag 
gctatctgta tgttttgcaa 
ccatcctgcc gttttggata 
gtaccggtgt ttaatggggg 
ctactattaa aaaataagat 
ggctcgtagc tggaaagtgt 
acatcgggac atgggtttga 
gctggataaa aatgtcacaa 
taatatttca gccgtgcttt 
ccttttttgc ttttaacaat 
acacttaaag ttatgagaac 
gccaccaagt attttaaaag 
gatacttttt ctatatatga 
caacattatt atattttaaa 
agatctacag tacctttttt 
actgaa 

<210> 66 

<211> 5045 

<212> DNA 

<213> Homo sapien 



PCT/US2003/040063 

91 



caaaatattt 


ttagaagtct 


ttttgtaaac 


taatttcttt 


3660 


taaaatgatt 


cccttaaaac 


caataaatca 


tcaattatta 


3720 


atactggttt 


ttcttttgtg 


ctgttatgat 


ttaacattga 


3780 


cttacgttca 


ggtgtttgta 


acttggcctt 


ataattaggc 


3840 


tacaatttat 


gtgtatggtc 


cacagcctag 


cttctattta 


3900 


taccaacttt 


ggattctttt 


ttagtcatat 


gtttgtcttt 


3960 


ccatttttta 


atggtcaagt 


tactaacact 


tgaaaatcag 


4020 


tttttccgta 


gtgtttttaa 


tgagcgcacc 


tattactact 


4080 


ccagtcattg 


ttatattaca 


aacagacttg 


catgattaac 


4140 


tcaagttgga 


gtatatatga 


ctcagtgcag 


actggtctct 


4200 


cagaaatgca 


gagtcaattt 


tacatgccca 


taaagacatt 


4260 


tggtctgttg 


tataaatgtg 


tatctaggca 


ctttattata 


4320 


atgttacaac 


ttgatcagtc 


gtttgaccta 


atttgtggta 


4380 


tcttaataca 


gacatgcttt 


ccaaaaagat 


caatacagaa 


4440 


agtctatcca 


gctgtggaaa 


gggcaacctg 


tggtttctct 


4500 


aagaatatga 


acagctttaa 


agagctgtgt 


attgtggtta 


4560 


ctgcacgagt 


ctgactggcc 


tttgggtggc 


ctttgtggac 


4620 


tgatctgggt 


tttctggcat 


tcttttaagt 


taaaaagtta 


4680 


tcttttgttg 


tacctgatga 


cagtgcagag 


attctccaca 


4740 


agctacttac 


tgtacatggg 


cagtatcaga 


tttcaaatcc 


4800 


taatactcaa 


aatattaggg 


gatggggtgt 


tgaagctttc 


4860 


tcatagaatt 


taacagatgt 


actgtctttc 


atgtggcctc 


4920 


atacacatgg 


tttacaactt 


ttactatata 


cctttccttg 


4980 


tgtgccacct 


tttaaccttt 


acttttttta 


agttgaaggt 


5040 


tgaaactcat 


gtcaactgaa 


gtgagtgtaa 


tcccagatat 


5100 


atcacgctat 


ggaaatatca 


cctgaattct 


gtcatttgtc 


5160 


tctttaactt 


ttagcatcaa 


ataaaaataa 


aattgggagc 


5220 



5226 
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<:400> 66 

gcggtggcgg 


cggcggccgc 


gtcccggcgg 


atgccctggg 


ccatgatctg 


ggacttcacc 


gagggcgccg 


accgcgtcga 


gctcgtcatc 


ggctgcttcc 


cggagggtcg 


ctccccaccc 


ccgccgctct 


ccgccaagga 


catccttttg 


cccgaggcgg 


ccccgcgcgc 


gccgcaggcc 


gagaggcccc 


cgcgcctcgg 


ctctgacttc 


cagccgccga 


cgccgcagcc 


gccgcccgtg 


aagctagagg 


agccgcccga 


gctgaatcgc 


gtgccgccca 


ccctggtgcc 


gctcatgaac 


ggcctcggcg 


gccgcgctgc 


cgcctcctta 


ctgggctccg 


cgcagcccac 


cgatctgggc 


agcgctgccg 


tggagcacga 


gcagcgtgag 


cctgcgcacc 


ggggcccggc 


cgacagcctg 


gcggaaggtg 


cgggcaagag 


ccgcgggtct 


accgtgcgcg 


acacgctgct 


ggcgctgcac 


aagtttaaga 


aggagccggc 


cctgactgca 


gccaacgggt 


ctaaagcagg 


taggggcggc 


gggacggaga 


gcagaggaag 


ggtggttctt 


caacaaacac 


cccacttcct 


gatctgcctg 


tcatgtgctg 


aagcagcata 


acagagatga 


ggttttcctt 


tcccagcaca 


ttcttggatg 


ctattggggg 


acattagcag 


tcacatgttg 


tacaaacagg 


cagttacaag 


cgtgtcattt 


cctcagaatc 


ccttgcaaga 


acagcaagga 


aagtcgggcc 


ccctaagatc 


aacggagagg 


ggctcaagat 


ccccatgact 


cctacatcct 


cacctcattc 


caaccggacc 


acaccgcctg 


cagccctgat 


cttagtagca 


gacaatgcag 


aggttcactc 


cactaccagg 


aggaatagca 
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cagtcgtgct 


acctgcgtga 


cctgccccgc 


60 


gaacccgtct 


gccgcggctg 


cgtcaactac 


120 


gagacggcgc 


ggcagctcaa 


gcgggcgcac 


180 


qgcqccgcQQ 


cctcggccgc 


cgccaagccg 


240 


cagcagcagc 


agcagcttgg 


ccacggcggc 


300 


ttagagcQct 


acccgctggc 


ggccgcggcc 


360 


ggcagcagcc 


gcccggcagc 


gagcctggcc 


420 


aacggcatcc 


tggtgcccaa 


cggcttcccc 


480 


cagagcccga 


acccgcggcg 


cggccacgcg 


540 


ggctcggcca 


cgccgctgcc 


caccgcgctc 


600 


gccgcggtgt 


ccggaaccgc 


ggccgccagc 


660 


gcccacaagc 


ggccggcatc 


cgtgtcgagc 


720 


gcggcagcca 


aggagaaaca 


accgccgccg 


780 


tccaccgcgg 


ccggggccgc 


cgagctgagc 


840 


ggagagcagg 


actgggtcaa 


caggcccaag 


900 


cagcacggcc 


actcggggcc 


cttcgagagc 


960 


ggcaggttgt 


tgggtttcga 


ggccaacggg 


1020 


tgtgaagtga 


gggggtctag 


gggagaaaag 


1080 


Cggattcacc 


attttacccc 


agcccagaaa 


1140 


aggcggaacc 


agtgcttggt 


ggcaacgtgt 


1200 


gtcagactgg 


gctgatacgc 


tctgacacgg 


1260 


ggagcatgag ggcaccagtc 


accttttaac 


1320 


agtgcaaacg 


agggcacttt 


tgtgcatgtg 


1380 


tcagtggctc 


cattttaaat 


cagtctgctg 


1440 


aaaggaagcc 


ctctccagaa 


ccagaaggtg 


1500 


cccagccgtg 


gctgtccaca 


tccacagagg 


1560 


cttttgtgtc 


tccgccacca 


cccactgcct 


1620 


aagcggccca 


gaatggccag 


tcccccatgg 


1680 


ggggcagtca 


tgcctcaaaa 


gatgccaacc 


1740 


acagtccgcc 


ctctccgtcc 


tctatgaacc 


1800 
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aaagaaggct gggccccaga gaggtggggg 
agccagtgca ccctgccagc ctcccggact 
gcaccctctg ccacgagcgg ctggaggaca 
cgcacaagtt ctgcttccct tgctccagac 
aggcctattg tcccagtggg gaaaaatgcc 
ttatgcaagg ggaaattgca accatccttg 
actcgcgact tttccggttt cagaaaaacc 
attgtacata tatctccata tatatatata 
acatggctgt ataattttga ttttttttga 
acaaaaggta cgtacttata aagacatttt 
tgtgctttat tatcctggtg acagttaccg 
ttcagagcac tcggcaaatc agaaatgctt 
aagaaaaaaa aagccaaata cattttctga 
ccttattgct ggcccctttc tcaggccgga 
tcgaacgggc atgttgtcaa gtgggcatgc 
acattgtcct cagaggagta ggaaagtgga 
gttcctgaga tactgatgac tgagagtgct 
ttggggtttt ttgttttttt ttttttaaga 
atcagttgta agattacaat gctgcatgct 
tggaaggcgg ttataatgga tggtggtgtg 
gcccagctct gcagcgccac ttcatctttt 
tgttgtcctt tattttgaaa gagttgcaag 
tgtgggattg gttttgtttc tgttttgttt 
gttgatacta tcacctatac cctgtttcta 
cagtaacagc cgcgtggtgc tgccaggact 
gtgggtggag gaaacctgta atttccttct 
tactgttctg gaaaaaaatg ataaactagt 
atagacaggc cccacctctc aaaatatttt 
gatcactatt ttgcatcagt aaaatgattt 
aaatagttgt ctcacagtga tactggtttt 
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gccagggagc 


aggcaacaca ggaggactgg 


1860 


cctctctggc aaccagtgcc ccgctgtgct 


1920 


r^r^n^ 1* t* f ^crf 
CC^ci 1. 1,. u u 






1980 






actaataaaa 


2040 


ctcttgtggg 


ctccaatgtc 


ccctgggcct 


2100 


ctggagatgt 


gaaagtgaaa 


aaagagagag 


2160 


caatgattac 


ccttaattaa 


aactgcttga 


2220 


tccaagacaa 


gggaaatgta 


gacttcataa 


2280 


atacattgtg 


tttctatatt 


ttttttgacg 


2340 


tttcttttgt 


taacgttatt 


agcatatctt 


2400 


ttctatgtag gctgtgactt gcgctgcttt 


2460 


ctagctgtat 


ttgtatgcac 


ttattttaaa 


2520 


cattgtaaga 


ttgccttact 


gtctgtcatt 


2580 


ggccaagtgg 


tggagaagga 


aaggaaatga 


2640 


cactgggaaa 


taccaccagt 


ttaccctgaa 


2700 


ttttgaatcc 


ctattttgct 


caaaagttca 


2760 


gctgggaaat 


tttcaggatt 


gtgtggtctt 


2820 


caaagttgac 


cgctgttcac 


tgtccacgtg 


2880 


agttggttac 


ataagataca 


attccagtga 


2940 


tacaagatgg 


cactgccatc 


cttgagcaga 


3000 


taaacaccct 


agaggtctgt 


ttgtcgttgc 


3060 


agaagtcaca 


gtccaggtga 


acttggagat 


3120 


tgttcatcat 


ctacctgtag 


tgctattgct 


3180 


gtgagtgctg 


aatacagtat 


ggtacaacga 


3240 


gcccttgggc 


atatcagtga 


cagcccaaac 


3300 


taacatgtgt 


ttgaaatacc 


aagtgaataa 


3360 


ggaaattaaa gaaattaagg gttttatata 


3420 


tagaagtctt 


tttgtaaact 


aatttctttt 


3480 


ttttaaaacc 


aataaatcat 


caattattag 


3540 


tcttttgtgc 


tgttatgatt 


taacattgac 


3600 
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aggaacacta ttttaaaccc ttacgttcag 
gaattatggc ttcaaggtct acaatttatg 
atttgaaaat acagatttcc accaacttcg 
ctttctaaat tgttcaaaac tattttttaa 
tactgcacca aatacagtgt ttttccgcag 
tgcgagaatt catgttttac cagccattgt 
agttgttaca cttacttttt caagttggag 
ttatgtgaat gcacacatgc agaaatgcag 
gtaaagaatt cagctcttat ggcctgttgt 
ctggaatttg acctcacaga tgttacaact 
ctatctgtat gttttgcaat cttaatacag 
catcctgccg ttttggataa gtctatccag 
tactggtgtt taatggggga agaatatgaa 
tactattaaa aaataagatc tgcacgagtc 
gctcgtagct ggaaagtgtt gatctgggtt 
catcgggaca tgggtttgat cttttgttgt 
ctggataaaa atgccacaaa gctactcact 
aatatttcag ctgtgctttt aatactcaaa 
cttttttgct tttaacaatt tatagaattt 
cacttaaagt tatgagaaca tacacatggt 
ccaccaagta ttttaaaagt gtgccacctt 
atacttcttc tatatatgac gaaactcatg 
aacattatta tattttaaaa tcacgctatg 
gatttacagt accttttttc ctttaacttt 
ctgaa 

<210> 67 

<211> 4956 

<212> DNA 

<213> Homo sapien 

<400> 67 

gcggtggcgg tggcggccgc gtcccggcgg 
atgccctggg ccatgatctg ggacttcacc 
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gcgcccgcaa 


cu uyyccu ua 


L>aa u uayy WW 


3660 


t.y tatyy etc 


d^oyu^UvUciyu- 


wwwwaww*»»*»ii» 


3720 


r^skl*t*/^l' ir t' tr 
ga t uCu C uC U 


udy u uduauy 


1 1 ^a^^?tccc 
uuuywwufc«ww 


3780 


i>yy t^ociy L. t 


a^t*aaoaf*t* ^ 


rtaaaatcaQa 


3840 


^ ^ ^ t' ^aah 
ugCuuCCoaL 


y ay uyu>acv»u 


fit t fiat acta 


3900 


cacaccacoa 


cicayacu uyC 


CiUyciU.uaaww 


3960 


f" a a a ftsi 
L a L. a u a I. ^CL C 


uu.oyuyU'ayci 


wuyyuwL.wk>w 


4020 


agccaacccL 


oCauyu.u,Ca.u 


oaayo^o^ ^ 


4080 




au^uayy^au. 


1 1 r a tta taa 


4140 


uyou^ciywu>y 


WW i>yawwi»aa 


tttataataa 

w u uy wyy way 


4200 


oi^ciuy^uuuu> 


waaaaay a i« \m 


aatiacagaac 


4260 


v uy uyy dociy 


y y WW wy w 




4320 


Cay v> U u Uaaa 


yaywuyuyua 


w wy wyywwow 


4380 


uya^ uy y^u. u 


1 1" aaat aaf*c^ 
u uyyy uyyww 


w w wy wyys* wy 


4440 


u uu.uyyu.au u 


w w u u uaay u w 


aaaaayww 


4500 


aUiV^ uyouyaw 






4560 


gcacaugggc 


ay Ua u wayci u 


t- 1 r^st^at cat 
uuwadau^wu 


4620 


auaucagygg 


atyyyy uy uu 


ans^aattt a a 
y doy w u u u w w 


4680 


aaCay a uy ua 


^tat at 1 1 a Si 
wwyuwuu uwo 


t at aacct.cA 

wy wyy WW w wm 


4740 


u uocaacu u u 


woiwuaUauaw 


at 1 1 aat t aa 
wuuuwwuwyy 


4800 


Ctiaaccctita 


ccuuLu wuaa 


/V ^ ^ /va a aa r a 

gccgaagguy 


4860 


tcaactgaag 


tgagtgtaat 


ctcagacatc 


4920 


gaaatatcac 


ctgaattctg 


tcatttgtca 


4980 


tagcattaaa 


taaaaataaa 


attgggagca 


5040 








5045 


cagtcgtgct 


acctgtgtga 


cctgccccgc 


60 


gaacccgtct 


gccgcggctg 


cgtcaactac 


120 
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gagggcgccg 


accgcgccga 


gttcgtcatc 


gagacggcgc 


ggcagctcaa 


gcgggcgcac 


180 


ggctgcttcc 


cggagggtcg 


ctccccaccc 


ggcgccgcgg 


cctcggccgc 


cgccaagccg 


240 


ccgccgctct 


ccgccaagga 


catccttttg 


cagcagcagc 


agcagcttgg 


ccacggcggc 


300 


cccgaggcgg 


ccccgcgcgc 


gccgcaggcc 


ttggagcgct 


acccgttggc 


ggccgcggcc 


360 


gagaggcccc 


cgcgcctcgg 


ctctgacttc 


ggcagcagcc 


gcccggcagc 


gagcctggcc 


420 


cagccgccga 


cgccgcagcc 


gccgcccgtg 


aacggcatcc 


tggtgcccaa 


cggcttctcc 


480 


aagctagagg 


agccgcccga 


gctgaatcgc 


cagagcccga 


acccgcggcg 


cggccacgcg 


540 


gtgccgccca 


ccctggtgcc 


gctcatgaac 


ggctcggcca 


cgccgctgcc 


caccgcgctc 


600 


ggcctcggcg 


gccgcgctgc 


cgcctcctta 


gccgcggtgt 


ccggaaccgc 


ggccgccagc 


660 


ctgggctccg 


cgcagcccac 


cgatctgggc 


gcccacaagc 


ggccggcatc 


cgtgtcgagc 


720 


agcgctgccg 


tggagcacga 


gcagcgtgag 


gcggcagcca 


aggagaaaca 


accgccgccg 


780 


cctgcgcacc 


ggggcccggc 


cgacagcctg 


tccaccgcgg 


ccggggccgc 


cgagctgagc 


840 


gcggaaggtg 


cgggcaagag 


ccgcgggtct 


ggagagcagg 


actgggtcaa 


caggcccaag 


900 


accgtgcgcg 


acacgctgct 


ggcgctgcac 


cagcacggcc 


actcggggcc 


cttcgagagc 


960 


aagtttaaga 


aggagccggc 


cctgactgca 


ggcaggttgt 


tgggttccga 


ggccaacggg 


1020 


gccaacgggt 


ctaaagcagg 


taggggcggc 


tgtgaagtga 


gggggtctag 


gggagaaaag 


1080 


gggacggaga 


gcagaggaag 


ggtggttctt 


tggattcacc 


attttacccc 


agcccagaaa 


1140 


caacaaacac 


cccacttcct 


gatctgcctg 


aggcggaacc 


agtgcttggt 


ggcaacgtgt 


1200 


tcatgtgctg 


aagcagcata 


acagagatga 


gtcagactgg 


gccgatacgc 


tctgacacgg 


1260 


ggttttcctt 


tcccagcaca 


ttcttggatg 


ggagcatgag 


ggcaccagtc 


accttttaac 


1320 


ctattggggg 


acattagcag 


tcacatgttg 


agtgcaaacg 


aggttgcaag 


aacagcaagg 


1380 


aaaaggaagc 


cctctccaga 


accagaaggt 


gaagtcgggc 


cccctaagat 


caacggagag 


1440 


gcccagccgt 


ggctgtccac 


atccacagag 


gggctcaaga 


tccccatgac 


tcctacatcc 


1500 


tcttttgtgt 


ctccgccacc 


acccactgcc 


tcacctcatt 


ccaaccggac 


cacaccgcct 


1560 


gaagcggccc 


agaatggcca 


gtcccccatg 


gcagccctga 


tcttagtagc 


agacaatgca 


1620 


gggggcagtc 


atgcctcaaa 


agatgccaac 


caggttcact 


ccactaccag 


gaggaatagc 


1680 


aacagtccgc 


cctctccgtc 


ctccatgaac 


caaagaaggc 


tgggccccag 


agaggtgggg 


1740 


ggccagggag 


caggcaacac 


aggaggactg 


gagccagtgc 


accctgccag 


cctcccggac 


1800 


tcctctctgg 


caaccagtgc 


cccgctgtgc 


tgcaccctct 


gccacgagcg 


gctggaggac 


1860 


acccattttg 


tgcagtgccc 


gtccgtccct 


ccgcacaagt 


tctgcttccc 


ttgctccaga 


1920 


caaagcatca 


aacagcaggg 


agctagtgga 


gaggtctatt 


gtcccagtgg 


ggaaaaatgc 


1980 



wo 2004/050860 PCT/US2003/040063 

96 



cctcttgtgg 


gctccaatgt 


cccctgggcc 


tttatgcaag 


gggaaattgc 


aaccatcctt 


2040 


gctggagatg 


tgaaagtgaa 


aaaagagaga 


gactcgtgac 


ctttccggtt 


tcagaaaaac 


2100 


ccaatgatta 


cccttaatta 


aaaccgcttg aattgtatat 


atatctccat 


atatatatat 


2160 


acccaagaca 


agggaaatgt 


agacctcata 


aacatggctg 


tataattttg 


attttttttg 


2220 


aatacattgt 


gtttctatat 


tttctttgac 


gacaaaaggt 


atgtacttat 


aaagacactt 


2280 


ttttcttttg 


ttaacgttat 


tagcatatct 


tcgtgcttta 


ttatcctggc 


gacagttacc 


2340 


gttctatgta 


ggctgtgact 


tgcgctgctt 


ttttagagca 


cttggcaaat 


cagaaatgcc 


2400 


tctagctgta 


tttgtatgca 


cttattttaa 


aaagaaaaaa 


aaagccaaat 


acattttctg 


2460 


acattgtaag 


attgccttac 


tgtctgtcat 


tccttattgc 


tggccccttt 


ctcaggccgg 


2520 


aggccaagtg 


gtggagaagg 


aaaggaaatg 


atcgaacggg 


catgttgtca 


agtgggcatg 


2580 


ccaccgggaa 


ataccaccag 


tttaccctga 


aacattgtcc 


tcagaggagt 


aggaaagtgg 


2640 


attttgaatc 


tctattttgc 


tcaaaagctc 


agttcctgag 


atactgatga 


ctgagagtgc 


2700 


tgctgggaaa 


ttttcaggat 


tgtgtggtct 


tttggggttt 


tttgtttttt 


tttttttaag 


2760 


acaaagttga 


ccgctgttca 


ctgtccacgt 


gatcagttgt 


aagattacaa 


tgctgcatgc 


2820 


tagttggtta 


cataagatac 


aattccagtg 


atggaaggcg 


gttataatgg 


atggtggtgt 


2880 


gtacaagatg 


gcactgccat 


ctttgagcag 


agcccagctc 


tgcagcgcca 


cttcatcttt 


2940 


ttaaacaccc 


tagaggtctg 


tttgttgttg ctgttgtcct 


ttattttgaa 


agagttgcaa 


3000 


gagaagttac 


agtccaggtg 


aacttggaga 


ctgtgggatt 


ggttttgttt 


ctgttttgtt 


3060 


ttgtccatca 


tttacctgta 


gtgctattgc 


tgttgatact 


atcacctata 


ccctgtttct 


3120 


agtgagtgct 


gaatacagta 


tggtacaatg acagtaacag 


ccgcgtggtg 


ctgccaggac 


3X80 


tgcccttggg 


catatcagtg 


acagcccaaa 


tgtgggtgga 


ggaaacctgt 


aatttccttc 


3240 


ttaacatgtg 


tttgaaatac 


caagtgaata 


atactgttct 


ggaaaaaaat 


gataaactag 


3300 


tggaaactaa 


agaaattaag ggttttatat 


aacagacagg 


ccccacctcc 


caaaataCtt 


3360 


ttagaagtct 


ttttgtaaac 


taattccttt 


tgatcactat 


tttgcatcag 


taaaacgatt 


3420 


tttttaaaac 


caataaatca 


tcaattatta 


gaaatagttg 


tctcacagtg 


atactggttt 


3480 


ttcttttgtg 


ctgttatgat 


ttaacattga 


caggaacact 


attttaaatc 


cttacgttca 


3540 


ggtgtttgta 


acttggcctt 


ataattaggc 


tgaattatgg 


cttcaaggtc 


tacaatttat 


3600 


gtgtatggtt 


cacagcctag 


cttctattta 


catttgaaaa 


tacagatttt 


taccaacttt 


3660 


ggattctttt 


ttagttatat 


gtttgtcttt 


cctttttaaa 


ttgttcaaaa 


ctatttttta 


3720 


atggtcaagt 


tactaacact 


tgaaaatcag atactgcacc aaatacagtg tttttccgta 


3780 
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gtgtttttaa 


tgagtgcacc 


tattactact 


ttatattaca 


aacagacttg 


catgattaac 


gtatatatga 


ctcagtgcag 


actggtctct 


gagtcaattt 


tacatgccca 


taaagacatt 


tataaatgtg 


tatctaggca 


ctttattata 


ttgatcagtc 


gtttgaccta 


atttgtggta 


gacatgcttt 


ccaaaaagat 


taatacagaa 


gctgtggaaa 


gggcaacctg 


tggtttctct 


acagctttaa 


agagctgtgt 


attgtggtta 


ctgactggcc 


tttgggtggc 


ctttgtggac 


tttctggcat 


tcttttaagt 


taaaaagtta 


tacctgatga 


cagtgcagag 


attctccaca 


tgtacatggg cagtatcaga 


tttcaaatcc 


aatattaggg gatggggtgt 


tgaagctttc 


taacagatgt 


actgtctttc 


atgtggcctc 


tttacaactt 


ttactatata 


cctttccttg 


tttaaccttt 


acttttttca 


agttgaaggt 


gtcaactgaa gtgagtgtaa 


tctcagatat 


ggaaatatca 


cccgaattct 


gtcatttgtc 


ttagcattaa 


ataaaaataa 


aattgggagc 


<210> 68 
<211> 4442 
<212> DNA 
<213> Homo sapien 




<400> 68 
gcggtggcgg 


tggcggccgc 


gtcccggcgg 


atgccccggg ccatgatctg 


ggacttcacc 


gagggcgccg 


accgcgtcga 




ggctgcttcc 


cggagggtcg 


ctccccaccc 


ccgccgctct 


ccgccaagga 


catccttttg 


cccgaggcgg 


ccccgcgcgc 


gccgcaggcc 


gagaggcccc 


cgcgcctcgg 


ctctgacttc 
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gtgcgagaat tcatgtctta ccagtcattg 3840 

cagttgttac acttactttt tcaagttgga 3900 

cttatgtgaa tgcacacatg cagaaatgca 3960 

tgtaaagaat tcagctctta tggtctgttg 4020 

actggaattt gacctcacag acgttacaac 4080 

gctatctgta tgttttgcaa tcttaataca 4140 

ccatcctgcc gttttggata agtctatcca 4200 

gtactggtgt ttaatggggg aagaatatga 4260 

ctactattaa aaaataagat ctgcacgagt 4320 

ggctcgtagc tggaaagtgt tgatctgggt 4380 

acatcgggac atgggtttga tcttttgttg 4440 

gctggataaa aatgtcacaa agctacttac 4500 

taatatttca gctgtgcttt taatactcaa 4560 

ccttttttgc ttttaacaat ttatagaatt 4620 

acatttaaag ttatgagaac atacacatgg 4680 

gccaccaagt attttaaaag tgtgccacct 474 0 

gatacttttt ctatatatga tgaaactcat 4800 

caacattatt atattttaaa atcacgctat 4860 

agatttacag tacctttttt tctttaactt 4920 

actgaa 4956 



cagtcgtgct acctgtgtga cctgccccgc 60 

gaacccgtct gccgcggctg cgtcaactac 120 

gagacggcgc ggcagctcaa gcgggcgcac 180 

ggcgccgcgg cctcggccgc cgccaagccg 240 

cagcagcagc agcagcttgg ccacggcggc 300 

ttggagcgct acccgttggc ggccgcggcc 360 

ggcagcagcc gcccggcagc gagcctggcc 420 
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cagccgccga cgccgcagcc gccgcccgtg 
aagctagagg agccgcccga gctgaatcgc 
gtgccgccca ccctggtgcc gctcatgaac 
ggcctcggcg gccgcgctgc cgcctcctta 
ctgggctccg cgcagcccac cgatctgggc 
agcgctgccg tggagcacga gcagcgtgag 
cctgcgcacc ggggcccggc cgacagcctg 
gcggaaggtg cgggcaagag ccgcgggtct 
accgtgcgcg acacgctgct ggcgctgcac 
aagtttaaga aggagccggc cctgactgca 
tctccagaac cagaaggtga agtcgggccc 
ctgtccacat ccacagaggg gctcaagatc 
ccgccaccac ccactgcctc acctcattcc 
aatggccagt cccccatggc agccctgatc 
gccccaaaag atgccaacca ggttcatcct 
ccctctgcca cgagcggctg gaggacaccc 
acaagttctg cttccctcgc tccagacaaa 
tctattgtcc cagtggggaa aaatgccctc 
tgcaagggga aattgcaacc atccttgctg 
cgtgactttt ccggtttcag aaaaacccaa 
gtatatatat ctccatatat atatatatcc 
tggctgtata attttgattt tttttgaata 
aaaggtatgt acttataaag acattttttt 
gctttattat cctggtgaca gttaccgttc 
agagcacttg gcaaaccaga aatgcttcta 
aaaaaaaaag ccaaatacat tttctgacat 
tattgctggc ccctttctca ggccggaggc 
aacgggcacg ttgtcaagtg ggcatgccac 
ttgtcctcag aggagtagga aagtggattt 
cctgagatac tgatgactga gagtgctgct 
gggttttttg ttttttttct tttaagacaa 
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aacggcatcc 


tggtgcccaa 


cggcttctcc 


480 


cagagcccga 


acccgcggcg 


cggccacgcg 


540 


ggctcggcca 


cgccgctgcc 


caccgcgctc 


600 


gccgcggtgt 


ccggaaccgc 


ggccgccagc 


660 


gcccacaagc ggccggcatc cgtgtcgagc 


720 


gcggcagcca 


aggagaaaca 


accgccgccg 


780 


tccaccgcgg 


ccggggccgc 


cgagctgagc 


840 


ggagagcagg 


actgggtcaa 


caggcccaag 


900 


cagcacggcc 


actcggggcc 


cttcgagagc 


960 


gttgcaagaa 


cagcaaggaa 


aaggaagccc 


1020 


cctaagatca 


acggagaggc 


ccagccgtgg 


1080 


cccatgactc 


ctacatcctc 


ttttgtgtct 


1140 


aaccggacca 


caccgcctga 


agcggcccag 


1200 


ttagtagcag acaatgcagg gggcagtcat 


1260 


ctctggcaac 


cagtgccccg 


ctgtgctgca 


1320 


attttgtgca 


gtgcccgtcc 


gtcccttcgc 


1380 


gcatcaaaca 


gcagggagct 


agtggagagg 


1440 


ttgtgggctc 


caatgtcccc 


tgggccttta 


1500 


gagatgtgaa agtgaaaaaa gagagagact 


1560 


tgattaccct 


taattaaaac 


tgcttgaatt 


1620 


aagacaaggg 


aaacgtagac 


ttcataaaca 


1680 


cattgtgttt 


ctatattttt 


tttgacgaca 


1740 


cttttgttaa 


• cgttattagc 


atatctttgt 


1800 


tatgtaggct 


gtgacttgcg 


ctgctttttt 


1860 


gctgtatttg 


tatgcactta 


ttttaaaaag 


1920 


tgtaagattg 


ccttactgtc 


tgtcattcct 


1980 


caagtggtgg 


agaaggaaag 


gaaatgatcg 


2040 


tgggaaatac 


caccagttta 


ccctgaaaca 


2100 


tgaatctcta 


tctcgctcaa 


aagttcagtt 


2160 


gggaaatttt 


caggattgtg 


tggtcttttg 


2220 


agttgaccgc 


tgttcactgt 


ccacgtgatc 


2280 
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agttgtaaga 


ttacaacgct 


gcatgctagt 


tggtcacaca 


agatacaatt 


ccagtgatgg 


2340 


aaggcggtta 


taatggatgg 


tggtgtgtac 


aagatggcac 


tgccatcttt 


gagcagagcc 


2400 


cagctctgca 


gcgccacttc 


atctttttaa 


acaccctaga 


ggtctgtttg 


ttgttgctgt 


2460 


tgtcctctat 


tttgaaagag 


ttgcaagaga 


agttacagtc 


caggtgaact 


tggagattgt 


2520 


gggattggtt 


ctgtttctgt 


tctgttttgt 


ttatcattta 


cctgtagtgc 


tattgctgtt 


2580 


gatactatca 


cctataccct 


gtttctagtg 


agtgctgaat 


acagtatggt 


acaatgacag 


2640 


taacagccgc 


gtggtgctgc 


caggactgcc 


cttgggcata 


tcagtgacag 


cccaaacgtg 


2700 


ggtggaggaa 


acctgtaatt 


tccttcttaa 


catgtgtttg 


aaataccaag 


tgaataatac 


2760 


tgttctggaa 


aaaaatgata 


aactagtgga 


aattaaagaa 


attaagggtt 


ttatataata 


2820 


gacaggcccc 


acctctcaaa 


atatttttag 


aagtcttttt 


gtaaactaat 


ttcttttgat 


2880 


cactattttg 


catcagtaaa 


atgatttttt 


taaaaccaat 


aaatcatcaa 


ttattagaaa 


2940 


tagttgtctc 


acagtgatac 


tggtttttct 


tttgtgctgt 


tatgatttaa 


cattgacagg 


3000 


aacactattt 


taaatcctta 


cgttcaggtg 


tttgtaactt 


ggccttataa 


ttaggctgaa 


3060 


ttacggcttc 


aaggtccaca 


atttatgtgt 


atggttcaca 


gcctagcttc 


tatttacatt 


3120 


tgaaaataca 


gatttttacc 


aactttggat 


tcttttttag 


ttatatgttt 


gtctttcctt 


3180 


tttaaattgt 


tcaaaaccat 


tttttaatgg 


tcaagttact 


aacacttgaa 


aatcagatac 


3240 


tgcaccaaat 


acagtgtttt 


tccgtagtgt 


ttttaatgag 


tgcacctatt 


actactgtgc 


3300 


gagaattcat 


gttttaccag 


tcattgttat 


attacaaaca 


gacttgcacg 


attaaccagt 


3360 


tgttacactt 


actttttcaa 


gttggagtat 


atatgactca 


gtgcagactg 


gtctctctta 


3420 


tgtgaatgca 


cacatgcaga 


aatgcagagt 


caattttaca 


tgcccataaa 


gacatttgta 


3480 


aagaattcag 


ctcttatggt 


ctgttgtata 


aatgtgtatc 


taggcacttt 


attataactg 


3540 


gaatttgacc 


tcacagatgt 


tacaacttga 


tcagtcgttt 


gacctaattc 


gtggtagcta 


3600 


tctgtatgtt 


ttgcaatctt 


aatacagaca 


tgctttccaa 


aaagattaat 


acagaaccat 


3660 


cctgccgttt 


tggacaagtc 


tatccagctg 


tggaaagggc 


aacctgtggt 


ttctctgtac 


3720 


tggtgtttaa 


tgggggaaga 


atatgaacag 


cttcaaagag 


ctgtgtattg 


tggttactac 


3780 


tattaaaaaa 


taagatctgc 


acgagtctga 


ctggcctttg 


ggtggccttc 


gtggacggct 


3840 


cgtagctgga 


aagtgctgat 


ctgggttttc 


tggcattctt 


ttaagctaaa 


aagttaacat 


3900 


cgggacatgg 


gtttgatctt 


ttgttgtacc 


tgatgacagt 


gcagagattc 


tccacagctg 


3960 


gataaaaatg 


ccacaaagct 


acttactgta 


catgggcagt 


atcagatttc 


aaatcctaat 


4020 


atttcagctg 


tgctcttaat 


actcaaaata 


ttaggggatg 


gggtgtcgaa 


gctttccctt 


4080 
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ttttgctttt aacaatttat agaatttaac 
ttaaagttat gagaacatac acatggttta 
ccaagcattt taaaagtgtg ccacctttta 
ctttttctat atatgatgaa actcatgtca 
attattatat tttaaaatca cgctatggaa 
ttacagtacc tttttttctt taacttttag 
aa 

<210> 69 

<211> 2056 

<2i2> DNA 

<213> Homo sapien 



<400> 69 
cccgcccgcg 


cggtcgtcgg 


agcgcggctc 


ggccaggcgc 


aggcgggcgg 


gcggagcagc 


gctcgcggcg 


gcgcagcgcg 


ccggagtggt 


gggcagctcg 


cacttgctca 


acaagggcct 


gcttcagtga 


ttgccgcgga 


tcagaagcct 


tgggatgagg 


ctccgagaca 


ggacgcggtt 


tgaaaagtcg 


aattcatggt 


cgtggaagct 


acctccgatc 


atgaacgggc 


ccctgcaccc 


ggactgcaca 


gtggagatgc 


ccatcctgaa 


gcagtccacg 


caggagatcc 


atgagaaggt 


ccacaccatc 


actctcacca 


gggaggacct 


ccggattggc 


agtggttttg 


acaacatcga 


cgtctgcaac 


gtgcccgcgg 


cgtctgtgga 


cctgaacctg 


taccggcggg 


ccacctggct 


ccagagcgtc 


gagcagatcc 


gcgaggtggc 


cttgggcatc 


atcggacttg 


gtcgcgtggg 


cggcttcaac 


gtgctcttct 


acgaccctta 


gctgcagcgt 


gtcagcaccc 


tgcaggacct 


ctgcggcctc 


aacgagcaca 


accaccaccc 


acaaggggcc 


ttcctggtga 


acacagcccg 
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agatgtactg 


tctttcatgt 


ggcctcacat 


4140 


caacttttac 


tatatacctt 


tccttggcca 


4200 


acctttactt 


tttttaagtt 


gaaggtgata 


4260 


actgaagtga 


gtgtaatctc 


agataccaac 


4320 


atatcacctg 


aattctgtca 


tttgtcagat 


4380 


cattaaataa 


aaataaaatt 


gggagcactg 


4440 








4442 


agcggcgcgg 


cggagactcg 


gcacggcggc 


60 


ggacggcgcc 


gaggccgccc 


cacgcgccgg 


120 


cggggcccgc 


ggccgctcgc 


gcctctcgat 


180 


gccgcttgta 


agactttgac 


cccgacgcag 


240 


gagcactttg 


gctcgagaca 


attaaggacg 


300 


ctgcctgggg 


atcctgaaga 


taaaaagctt 


360 


gagcccatat 


taagagatgt 


caggcgtccg 


420 


gcggcccctg 


gtggcattgc 


tggatggccg 


480 


ggacgtggcc 


actgtggcct 


tctgcgacgc 


540 


cctgaacgag 


gctgtggggg 


ccctgacgta 


600 


ggagaagttc 


aaagccctcc 


gcatcatcgt 


660 


catcaagtcg 


gccggggatt 


taggcattgc 


720 


ggagacggcc 


gactcgacgc 


tgtgccacat 


780 


gcaccaggcg 


ctgcgggagg 


gcacacgagt 


840 


gtccggcgct 


gccaggatcc 


gcggggagac 


900 


gcaggcagtg 


gcgctgcggg 


ccaaggcctt 


960 


cttgtcggat 


ggcgtggagc 


gggcgctggg 


1020 


gctcttccac 


agcgactgcg 


tgaccctgca 


1080 


catcaacgac 


ttcaccgtca 


agcagatgag 


1140 


gggtggcctg 


gtggatgaga 


aggcgctggc 


1200 
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v.vj^yy wwsm 


gatgtgcacg 


agtcggaacc 


1260 


f' f*Artf^^ H f" 
C U L. Ca^C C t u 


<*y ^^»yyy 




tacacccaac 

W\4 Wm W W 9 W 


ctcatctgca 


ccccccatgc 


1320 




aguyagcayy 




era ffTpaacTAQ 

y G> lay wy 6»y **y 


aaoQcaacac 

yayy wyy wow 


yyyoy o wwy 


1380 




avvciyy v^yy d 




cctciaaaaac 

WW wy ciGi y ^ ^ w 


ttctgccctg 


tgtccttcgc 


1440 


2 L. U C C U U. U 


a CI y v> ay a ciy CI 


a fit" pa at" acrt 
ay L.^ay i>ci^ u 


w M ^ w w w w w wd 


cgaacgtLtct 


tgtctgtgta 


1500 


^ a^f* ^ ^ ^ H an 

Ca^ u L u u L. 


eiGiWi>cii.Lci^oici 


ayy a u iiv wy u u 


^yw^baywi^y 


tcaacaaaaa 


gaaaaccCga 


1560 


^r^rtsnr^A H f* t* 
ci^^civjCa u u u 


a <Tf f" r*a a ^ 
yyadyuL'Cicii.' 


L i»y ayy u u u u 


tttttttaat 

w w ^ V ^ w ^y y ^ 


tttttttttt 


ttatatattQ 


1620 


gaacgtgccc 


Cay aacy ayy 


Cay u ty y caa 


ci^L. L.wi.>wciyy 


??Pr5a taaa tp 

d W GL C& W y GL Ol W W 


w w V* w w w y www 


1680 




« /^/^a /^a /^a 

y CCaCaCay L 


rtr'a t- h nt" ^ t" 

ycauuyuuuu 


^L>WLiCIWWl>yW 


ttatcttatt 


C^^agaacaa 


1740 




r^aaaa/^aaan 
waacicicoeiciy 


y^uyuuwuuv* 


w^ub^bW ^wyy 


catoaacccc 

w^* w y w w w 


www W w *J W W WW 


1800 




y y i» wci^^j » 




aaaggaaaag 




qcccagcQtg 


1860 


cccgctgccc 


atcgacgtct 


gtcctgggga 


cgtggagggt 


ggcagcgtcc 


ccgcctgcac 


1920 


cagtgccgtc 


ctgctgatgt 


ggtaggctag 


caatattttg 


gttaaaatca 


tgtttgtgac 


1980 


tgtaaccatt 


tgtatgaatt 


attctaaaga 


aataaaaatc 


ctggaaagag 


ccagcgtgcc 


2040 


cagcaaatct 


attctg 










2056 


<210> 70 
<211> 1862 
<212> DNA 
<213> Homo sapien 












<400> 70 
cagtggggga 


gtttgccgcc 


cy^uy^yyyy 


*-y ayy « uyy » 


accacQQQCt 

^ww*«»w333ww 


ctaggcctgc 


60 


cttgcctcgg 


tcctgtgagg 


yy^ L>y\« u k«aa 


PC net" pacTP 

www WW w wdy w 


cc t tctt tgg 


y wuv^^^w w WW 


120 


tgaccaacca 


ggagagagtg 


oyyww wway u 


y uw uy uy &y u 


yy wwwww«*»y 


aaccctiaacc 

yywwwwyyww 


180 


atggagtcat 


ggcatcccgt 


wway a WW w 


aacccctaca 


y waww w wyyy 


accccaggca 


240 


gccctgggtt 


ggatgttcag 


v» wy a &y (ay y 


V4 W^» W W W W W 


acaoQOQcaQ 

awayyyy wc*y 


tatcagcctg 


300 


cagctcctgg 


ggctacagac 


wia* u ^yy w ^•y 


aaaaaoQcca 

y ^uy y y y w wd 


aacaocatat 

yy woy w»by b 


ccccccatca 


360 


gggctcctcc 


cacgagggac 


t uyyaya u wa 


aat'pppt" Too 
aa w w wv» ^ ^yy 


y w www wo wet w 


ccaccac tat 

W W*4 W W^4 W %m \m 


420 


caggagcccc 


tggtgtgtct 


yauyy uww^B>y 


yyy^i-y i-y^-i- 


ct taacrctca 

W W ^^^4 W W^ 


accaacctac 

*J W W***J w W W w 


480 


cgagcagagc 


ggggccgtgg 


ggtctgtggt 


tcccaggcag 


ggccccccac 


399999 t tec 


540 


agtgcccagc 


cacctcccct 


ccgtacccag 


ccacagcact 


caggctgcac 


tgaccatgcc 


600 


tgtgctgttc 


ccagctttag 


ccagggccct 


ctgaaggatg 


cacccaacct 


catctgcacc 


660 


ccccatgctg 


catggtacag 


cgagcaggca 


tccatcgaga 


tgcgagagga 


ggcggcacgg 


720 
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gagatccgca 


gagccatcac 


aggccggatc 


gaccatctga 


cagccgccac 


ccactgggcc 


ctcaatgggg 


ctgcctatag 


gtaccctccg 


ccagctgctg 


tggaaggtat 


cgtccccagc 


gtggcccacc 


cgccccacgc 


cccttctcct 


gaccacgcca 


gtgaccagct 


gtagcccggg 


agggcccgga 


aaccctcgga 


ccagagtgtg 


actgcagaga 


ctggtccggg 


ctgtcaggag 


gcttgtcgtc 


gtccgtcctg 


tgggcgctct 


agaagaagtc 


agtagttatt 


ctcccatgaa 


ttacaaagga 


tctgtttgct 


tagctgtcaa 


gtcaatttga 


ggtttttttt 


tttggttttc 


atgaggcagt 


tggcaaactt 


ctcaggacaa 


cacagcgcat 


tgttttttct 


acctgcttgt 


acaaaggctg 


tttttcctaa 


ttttggcatg 


ccacgaagca 


gctccaaaag 


gaaaagcttg 


acgtctgtcc 


tggggacgtg 


gagggtggca 


tgatgtggta 


ggctagcaat 


attttggtta 


tgaattatct 


taaagaaata 


aaaatcctgg 



tg 

<210> 71 

<211> 1168 

<212> DNA 

<213> Homo sapien 

<400> 71 

ctgcatggta cagcgagcag gcatccatcg 
gccagggccc tctgaaggat gcacccaacc 
tggaccccgc cgtcgtgcac cctgagctca 
tggtgggcgt ggcccccact ggcatcccag 
tgtccctgtc ccacggcctg ccccctgtgg 
aaaccgtcaa gcccgaggcg gatagagacc 
gctctccagc ctcggcgcct gggcagaggg 
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ccagacagcc 


tgaagaactg 


tgtcaacaag 


780 


agcatggacc 


ccgccgtcgt 


gcaccctgag 


840 


ggcgtggtgg 


gcgtggcccc 


cactggcatc 


900 


gccatgtccc 


tgtcccacgg 


cctgccccct 


960 


ggccaaaccg 


tcaagcccga 


ggcggataga 


1020 


aggagctctc 


cagcctcggc 


gcctgggcag 


1080 


tggaggaggc 


atctgtgtgg 


tggccctggc 


1140 


gcgggagggg 


gcagcgctgg 


gcctcgtgtc 


1200 


gccctgtgtc 


cttcgcgttc 


ctcgttaagc 


1260 


cgttcttgtc 


tgtgtacagt 


ttttagaaca 


1320 


caaaaagaaa 


acctgaagga 


gcatttggaa 


1380 


tttttttcgt 


atgttggaac 


gtgccccaga 


1440 


tgaatccttc 


ccgtttttct 


ttttatgcca 


1500 


cttattttta 


gaataattta 


gaaaaacaaa 


1560 


aaccccccct 


Cgttccaaat 


gaagacggca 


1620 


ggcggtgccc 


agcgtgcccg 


ctgcccatcg 


1680 


gcgtccccgc 


ctgcaccagt 


gccgtcctgc 


1740 


aaatcatgtt 


tgtgactgta 


accatttgta 


1800 


aaagagccag 


cgtgcccagc 


aaatctattc 


1860 








1862 


agatgcgaga 


kgaggcaccc 


ttcagcttta 


60 


tcatctgcac 


cccccatgct 


gcatggtaca 


120 


atggggctgc 


ctataggtac 


cctccgggcg 


180 


ctgctgtgga 


aggtatcgtc 


cccagcgcca 


240 


cccacccgcc 


ccacgcccct 


tctcctggcc 


300 


acgccagtga 


ccagttgtag 


cccgggagga 


360 


cccggaaacc 


ctcggaccag 


agtgtgtgga 


420 



wo 2004/050860 PCTAJS2003/040063 



















gcgLggtggc 




^«y c*yau i_yy 


uu.uyyywcyL. 


Cdgydygt-yy 


480 


gagggggcag 


cgctgygcct 




y u^y L. ^y w 


y t. u L y ugy y 


^or*t* r*f^ 
(vyi»uv«i.ywww 




cgcgcccccc 


gcgcLccucg 




yaayLw>ciyuoi 


yL.UdLLU-L^u> 


/^a ^ oa a t" t* 
(..duyddcy L. L. 


600 


ccugcccgcg 


Locagu cccc 




addyyci i» l» uy 


uuwy^-'UUaysa' 


wyv.v*dd^cioid 


660 


aagaaaaccc 


gaaggagcat 


c cggaaguca 


acuuydyyc u 


UL.Lil.UUtirUUy 


yuui.uuuuL.u 


720 


ccccgcacgc 


tggaacgtgc 


cccagaacga 


ggcagc cggc 


dddU.L.i.wu ^d 


nnsk r*a a ^ /na a 
yyd^dd uy dd 


780 


tccttcccgt 


uCCCCtCCCC 


atgccacaca 


gcgcac ugcc 


^ ^ t- ^ f^t^ Tir^e^f' 
C tiCt.dw^U 


y(.UL.yu^L.ud 


840 


Cut ctagaat 


aat ttagaaa 


aacaaaacaa 


aggctgt 1 1 1 


♦"/"r-t-aat-*-*-^ 
u CC Uaau u t. L. 


y y u.d uy ddcc 


7 W U 


cccccc cgc u 


ccaaacgaag 


a ^ a ^ ^ 

dcggca ucac 


ydayc^ay CL.^ 


Caddctyyddd 


dy^ L. uyyy ^y 


960 


gtgcccagcg 


tgcccgctgc 


ccatcgacgt 


ctgtcctggg 


gacgtggagg 


gtggcagcgt 


1020 


ccccgcctgc 


accagtgccg 


tcctgctgat 


gtggtaggct 


agcaatattt 


tggttaaaat 


1080 


catgtttgtg 


actgtaacca 


tttgtatgaa 


ttattttaaa 


gaaataaaaa 


tcctggaaag 


1140 


agccagcgtg 


cccagcaaat 


a ^ f* ^ 

CuauuCCg 








X X D O 


<210> 72 
<211> 1352 
<212> DNA 
<213> Homo sapien 












<400> 72 
gggcatctcg 


tggggtggga 




yy tyyygcdu 


v» wt.yyy ayy i> 


Qercrorr*a t" ft" r* 
y y y y i> ^ w ^ 


60 


tggggcccgg 


ccacttggga 


ggcggggcac 


cccgggggcg 


yyy u.d I.C u\»d 


ydyyy t.yt.u u 




ccggaggctg 


gagtatcttg 


99^99^999^ 


gcaggcggca 


ydydyy c u 


CwdCdy cuyy 


1 fin 


ctggaggcgt 


gatcctgggt 


gtggccctgt 


ggct ccgcca 


uycLCCcy Cdy 


a ^ a a a /^/^ 
d C w d C C dd ^ 


0 ACS 
Z H U 


tcctgtatct 


ggagccggga 


gacaagcccg 


Cy LrCCddCdU. 


CUuCUdWyCd 


yyu.dii,u.ud^d 




tcctcatcgc 


tgtgggcgct 


gccacgacgc 


uCy CuyyCu u 


ccuyyycuy w 


wd^y yyy 


•7 V V 


Cccaggaatc 


ccagtgcctg 




L ucdcc wy 


wc^yy LrCd(>c 


^•Lyuuuy^^u 


rt ^ V 


gtgaggtggc 


cgccggcatc 


cggggcc c^y 


^ ^a a ^ a a mm a 

i^caolCaayya 


^ v»ay d I. Vi.y ^ w 


a a rtof a t* nh oa 
adyyduy uy d 


*t o u 


agcagttcta 


tgaccaggcc 


ccacagcagg 


ccgcggugga 


uydtryaCyCC 


a a r*a a r*t^ct^st 
dds^ddwyt'Wd 




aggctgtggt 


gaagaccttc 


Cocgagacgc 


c ugaccycug 


tggcuccdyc 


a /*a r*^ ns»^^ cm 
dCdCuydwcy 


goo 


ctttgaccac 


ctcagtgctc 


aagaacaatt 


tgtgtccctc 


9g9cagcaac 


atcatcagca 


660 


acctcttcaa 


ggaggactgc 


caccagaaga 


tcgatgacct 


cttctccggg 


aagctgtacc 


720 


tcatcggcat 


tgctgccatc 


gtggtcgctg 


tgatcatgat 


cttcgagatg 


atcctgagca 


780 


tggtgctgtg 


ctgtggcatc 


cggaacagct 


ccgtgtactg 


aggccccgca 


gctctggcca 


840 
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<400> 73 

agatgagcgg ggcagtcaca tcccaccttc cccaagccgg gctgttctgc acagcctgct 



900 
960 



104 

cagggacctc tgcagtgccc cctaagtgac ccggacactt ccgagggggc catcaccgcc 
tgtgtatata acgtttccgg tattactctg ctacacgtag cctttctact tttggggttt 

tgtttttgtt ctgaactttc ctgttacctt ctcagggctg acgtcacatg taggtggcgt 1020 

gtatgagtgg agacgggcct gggtcttggg gactggaggg caggggtcct tctgccctgg 1080 

ggtcccaggg tgctctgcct gcccagccag gcctctcctg ggagccactc gcccagagac 1140 

tcagcttggc caacttgggg ggctgtgtcc acccagcccg cccgtcctgt gggctgcaca 1200 

gctcaccttg ttccctcctg ccccggttcg agagccgagt ctgtgggcac tctctgcctt 1260 

catgcacctg tcctttctaa cacgtcgcct tcaactgtaa tcacaacatc ctgactccgt 1320 

catttaataa agaaggaaca tcaggcatgc ta 1352 



<210> 73 

<211> 1445 

<212> DNA 

<213> Homo sapien 



60 



tgggacgctg gtgggagtca ctgtggcctt cggcactgcc ctggcagtgg gggcagctag 120 

gccatttggg aggggctcgc cttccccagg cccggccctg ggacctcagc cgttgcttag 180 

tggtggcctg cttcagccca ggcatgtggg agaggcacca gacacaggat gtccctctgc 240 

cagcccctga agccccgtcc cctgacgagc tggctggagg cgtgatcctg ggtgtggccc 300 

tgtggctccg ccatgacccg cagaccacca acctcctgta tctggagctg ggagacaagc 360 

ccgcgcccaa caccttctat gtaggcatct acatcctcat cgctgtgggc gctgtcatga 420 

tgttcgttgg cttcctgggc tgctacgggg ccatccagga atcccagtgc ctgctgggga 480 

cgttcttcac ctgcctggtc atcctgtttg cctgtgaggt ggccgccggc atctggggct 540 

ttgtcaacaa ggaccagatc gccaaggatg tgaagcagtt ctatgaccag gccctacagc 600 

aggccgtggt ggatgatgac gccaacaacg ccaaggctgt ggtgaagacc ttccacgaga 660 

cgcttgactg ctgtggctcc agcacactga ccgctttgac cacctcagtg ctcaagaaca 720 

atttgtgtcc ctcgggcagc aacatcatca gcaacctctt caaggaggac tgccaccaga 780 

agatcgatga cctcttctcc gggaagctgt acctcatcgg cattgctgcc atcgtggtcg 840 

ctgtgatcat gatcttcgag atgatcctga gcatggtgct gtgctgtggc atccggaaca 900 

gctccgtgta ctgaggcccc gcagctctgg ccacagggac ctctgcagtg ccccctaagt 960 

gacccggaca cttccgaggg ggccatcacc gcctgtgtat ataacgtttc cggtattact 1020 

ctgctacacg tagccttttt acttttgggg ttttgttttt gttctgaact ttcctgttac 1080 
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ct 1 1. tcaaao 


ctgacgtcac 


atgtaQQtgg 




aqqcaQQQQt 

3 33*-'** 33333 


ccttccgccc 


caggcctctc 


ctgggagcca 


ctcgcccaga 


tccacccagc 


ccgcccgtcc 


tgtgggctgc 


tcgagagccg 


agtctgtggg 


cactctctgc 


ccttcaactg 


taatcacaac 


atcctgactc 


tgcta 






<210> 74 
<211> 2290 
<212> DNA 
<213> Homo sapien 




<400> 74 
gtcaggacca 


aaaaaggcaa 


cgcgcctctc 


tgcaacccag 


gagtgtcagg 


QCCtQaqqqg 

333333 


gtaaggaggt 


ggccacctgc 


agggcattcc 


gctgggtggc 


cgagccccaa 


atctqqqtqt 


ggcaccttct 


cctctgggct 


qgctqqqcaq 

333333 




ttctgcattg 


cctcggtgac 


gggagtgggc 


aggcagtgag 


agacactgac 


ggtggccagg 


cccatgtctc 


qqqcccacaq 


ccctccccag 


gtgcatgctg 


cccagccagg 


acacgccctg 


gtccagctgt 


cccaggcggq 

*'***"**33 ^3 33 


taggaatcca 


acttgatcct 


ccccacacag 


gaagtgctgc 


tgtgcagccc 


aqattqqqqq 

33333^ 


tctcgcacca 


cactggggag 


qcaqcattgg 


aaccccggcc 


tacagtgggg 


cccaccctgt 


agtcccagag 


gctttgggag 


caqqtqaaqq 

'-333^3**^33 


tgggtggtgg 


gtgtggcagg 


tggcgggccc 


gtcgccagca 


ctcagagcgc 


tcatgaggtg 


gtcctgtgtc 


atggaagagg 


taactgaggc 


tgaggtccct 


tgctgctcat 


ccctggcagt 


cccgtggtgg 


gttcggcacc 


caggaccctc 
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105 
cgtgtatgag 


tggagacggg 


cctgggtctt 


1140 


tggggtccca 


gggtgctctg 


cctgctcagc 


1200 


gactcagctt 


ggccaacttg 


gggggctgtg 


1260 


acagctcacc 


ttgttccctc 


ctgccccggt 


1320 


cttcatgcac 


ctgtcctttc 


taacacgtcg 


1380 


cgtcattcaa 


taaagaagga 


acatcaggca 


1440 
1445 


ctgacctgta 


ccccggagtg 


aacccaacct 


60 


agggagacct 


ggctcctggg 


tgccgtgccc 


120 


tggcagaggc 


ttcatctggc 


caggtaggag 


180 


gttctctgcc 


tggcggtggg 


tcctgcccca 


240 


ggacaatggg 


cctggctgcg 


aggagggggc 


300 


gggagatggc 


ccctgcctgc 


tgagggatag 


360 


agctgtcccg 


cgggtacagg 


gccctgtctg 


420 


tgcgcccccc 


acccttggac 


ggcgccttct 


480 


gagcgtgggg 


gagttcggga 


gggctggcct 


540 


gtgctgggct 


tcagccctca 


gcccagggcc 


600 


cagccaggtt 


caaatgcagg 


tcccgtaacg 


660 


gcaggagcca 


gcagggcccc 


cccaccctct 


720 


ttccagttcc 


ggttcctggg 


ctgccctctc 


780 


gccttctgat 


gccactccca 


ccccacgcca 


840 


cggtgggtgg 


cgggtggcag 


gtgcaggcgg 


900 


caccgcaggt 


gtcatccctg 


cgaagcacct 


960 


cccagtcccc 


atgtggcctc 


cttagtctcc 


1020 


acagaaaact 


caccaggcca 


ggctgggatg 


1080 


cagcaaccct 


acatcttccc 


agctgggcgg 


1140 


cggggtcttg 


ggctgtggcg 


agtgtgtagg 


1200 
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cacccacctg 


gtgtctccct 


ccccgcaagg 


J. U D 

catctacatc 


cccaccgctg 


tgggcgctgt 


1260 


catgatgttc 


gttggcttcc 


tgggctgcta 


cggggccatc 


caggaatccc 


agtgcctgct 


1320 


ggggacgttc 


ttcacctgcc 


cggtcatcct 


gtttgcctgt 


gaggtggccg 


ccggcatctg 


1380 


gggctttgtc 


aacaaggacc 


agatcgccaa 


ggatgtgaag cagttctatg 


accaggccct 


1440 


acagcaggcc 


gtggtggatg 


atgacgccaa 


caacgccaag gctgtggtga 


agaccttcca 


1500 


cgagacgctt 


gaccgctgtg 


gctccagcac 


actgactgct 


ttgaccacct 


cagtgctcaa 


1560 


gaacaatttg 


tgtccctcgg 


gcagcaacat 


catcagcaac 


ctcttcaagg 


aggactgcca 


1620 


ccagaagatc 


gatgacctct 


tctccgggaa 


gctgtacctc 


atcggcattg 


ctgccatcgt 


1680 


ggtcgctgtg 


accatgatct 


tcgagatgat 


cctgagcatg 


gtgctgtgct 


gtggcatccg 


1740 


gaacagctcc 


gtgtactgag 


gccccgcagc 


tctggccaca 


gggacctctg 


cagtgccccc 


1800 


taagcgaccc 


ggacacttcc 


gagggggcca 


tcaccgcctg 


ugLauauaaC 


gtttccggta 


1860 


ttactctgct 


acacgtagcc 


tttttactct 


tggggttttg 


ttcttgttct 


gaactttcct 


1920 


gttacctttt 


cagggctgac 


gtcacatgta 


ggcggcgtgt 


atgagtggag 


acgggcctgg 


1980 


gtcttgggga 


ctggagggca 


ggggtccttc 


tgccctgggg 


tcccagggtg 


ctctgcctgc 


2040 


tcagccaggc 


ctctcctggg 


agccactcgc 


ccagagactc 


agcttggcca 


acttgggggg 


2100 


ctgtgtccac 


ccagcccgcc 


cgtcctgtgg 


gctgcacagc 


tcaccttgtt 


ccctcctgcc 


2160 


ccggttcgag 


agccgagtct 


gtgggcactc 


tctgccttca 


tgcacctgtc 


ctttctaaca 


2220 


cgtcgccttc 


aactgtaatc 


acaacaccct 


gactccgtca 


tttaataaag 


aaggaacatc 


2280 


aggcatgcta 












2290 


<210> 75 
<211> 1033 
<212> DNA 
<213> Homo sapien 












<400> 75 
ccattgtgct 


ggaaagagcg 


cgcaacggcg 


gcgacggcgg 


cgaccccacc 


gcgcatcctg 


60 


ccaggcctcc gcgcccagcc 


gcccacgcgc 


ccccgcgccc 


cgcgccccga 


ccctttcttc 


120 


gcgcccccgc 


ccctcggccc 


gccaggcccc 


cttgccggcc 


acccgccagg 


ccccgcgccg 


180 


gcccgcccgc 


cgcccaggac 


cggcccgcgc 


cccgcaggcc 


gcccgccgcc 


cgcgccgcca 


240 


Cgggagtgga gggctgcacc 




agtacctgct 


cttcgtcttc 


aatttcgtct 


300 


tctggctggc 


tggaggcgtg 


atcctgggtg 


tggccctgtg gctccgccat 


gacccgcaga 


360 


ccaccaacct 


cctgtatctg 


gagctgggag 


acaagcccgc 


gcccaacacc 


ttctatgtag 


420 


gcatctacat 


cctcatcgct 


gtgggcgctg 


tcatgatgtt 


cgttggcttc 


ctgggctgct 


480 
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acggggccat 


ccaggaaccc 


cagtgcctgc 


tgtttgcctg 


tgaggtggcc 


gccggcatct 


aggatgtgaa 


gcagttctat 


gaccaggccc 


acaacgccaa 


ggctgtggtg 


aagacctccc 


cactgactgc 


tttgaccacc 


tcagcgctca 


tcatcagcaa 


cctcttcaag 


gaggactgcc 


agctgtacct 


catcggcatc 


gctgccaccg 


tcctgagcat 


ggtgctgtgc 


tgtggcatcc 


ctctggccac 


agggacctct 


gcagtgcccc 


cgaatgttca 


ggc 





<210> 76 

<211> 1190 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1081) . . (1081) 

<223> n=a, c, g or t 

<220> 

<221> niisc_feature 

<222> (1122) . . (1122) 

<223> n=a, c, g or t 

<400> 76 

ccattgtgct ggaaagagcg cgcaacggcg 
ccaggcctcc gcgcccagcc gcccacgcgc 
gcgcccccgc ccctcggccc gccaggcccc 
gcccgcccgc cgcccaggac cggcccgcgc 
tgggagtgga gggctgcacc aagtgcatca 
tctggctggc tggaggcgtg atcctgggtg 
ccaccaacct cctgtatctg gagctgggag 
gcatctacat cctcatcgct gtgggcgctg 
acggggccat ccaggaatcc cagtgcctgc 
tgtttgcctg tgaggtggcc gccggcatct 
aggatgtgaa gcagttctat gaccaggccc 
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egg y y ciCy I- u 


Ct t t.ciw^ L.y «M 


vrU>yy 


54 0 


ggggct u egc. 


a /** A a nnA 

caaccsciyy ov.* 


VivCiyci ^\«yvvV>o 


600 


Cacagcciggc 


cgtgy t.yyd,t 


y a b y a w VrGi 


660 


acgciy ci<»y v>u 


uyo^uywwy \» 




720 


agaacaacL c 


y uy t.^»*-i- 1 ^y 


y y w \« o Gi ci 


780 


accagaagat 


ego uyciwc U V. 


L.U ^ L. v»^yy ycl 


840 


tggtcgctgt 


gatcatgatc 


ttcgagatga 


900 


ggaacagctc 


cgtgtactga 


ggccccgcag 


960 


ctaagtgacc 


cggacacaat 


gttcaggaca 


1020 








1033 
• 


gcgacggcgg 


(.y CL s. ^ oi ^ ^ 


yv-y^Gi^^*M ^y 


60 


^Cu>(«y ^y 


wy wy ^ wo\»y CI 




120 


^ ^ Ly ^^yy 


o *«- v» y V- oy y 


ccccacacca 


180 


cccyCciyycc 


y ^^^y wwy iiv^ 


w y v^y w \vy v» w a 


240 


ay uaCC tyCu 


uwy 


ao w ^ ^ ^ ^ ^ 


300 


uggcccwy uy 


y ^ u^tfWy w^oi u 


cfa f fOflfaoa 
yoww^y^ayci 


J o w 


acaagcccgc 


y v> w w oi a w oi v« V* 


u uiia> UGi uy uay 


450 


tcatgatgtt 


cgttggcttc 


ctgggctgct 


480 


tggggacgtt 


cttcacctgc 


ctggtcatcc 


540 


ggggctttgt 


caacaaggac 


cagatcgcca 


600 


tacagcaggc 


cgtggtggat 


gatgacgcca 


660 
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acaacgccaa ggctgtggtg aagaccttcc acgagacgct tgactgctgt ggctccagca 720 

cactgactgc tttgaccacc tcagtgctca agaacaattc gtgtccctcg ggcagcaaca 780 

tcatcagcaa cctcttcaag gaggactgcc accagaagat cgatgacctc ttctccggga 84 0 

agctgtacct catcggcatt gctgccatcg tggtcgctgt gatcatgatc ttcgagatga 900 

tcctgagcat ggtgctcaat gataatcttt gtattatagg aaaagtaagg attagtggaa 960 

gacagggttt ctatccaaac caacaacata aacggcaata taattgttaa aattagcctt 1020 

tatatgccat atgttatgcc tggtacatag tagactccca ataaatcttt ggtggcttga 1080 

nttgaagagg taaagttcag atgaaatctt tggcctcctt tnctggccgg ggacagtaag 1140 

attctcatgc taagcaataa aatggtctta tctcttaata atccccatcc 1190 

<210> 77 

<211> 871 

<212> DNA 

<213> Homo sapien 

<400> 77 

ccattgtgct ggaaagagcg cgcaacggcg gcgacggcgg cgaccccacc gcgcatcctg 6 0 

ccaggcctcc gcgcccagcc gcccacgcgc ccccgcgccc cgcgccccga ccctttcttc 120 

gcgcccccgc ccctcggccc gccaggcccc cttgccggcc acccgccagg ccccgcgccg 180 

gcccgcccgc cgcccaggac cggcccgcgc cccgcaggcc gcccgccgcc cgcgccgcca 240 

tgggagtgga gggctgcacc aagtgcatca agtacctgct cttcgtcttc aatttcgtct 300 

tctggctggc tggaggcgcg atcctgggtg tggccctgtg gctccgccat gacccgcaga 360 

ccaccaacct cctgtatctg gagctgggag acaagcccgc gcccaacacc ttctatgtag 420 

gcatctacat cctcatcgct gtgggcgctg tcatgatgtt cgttggcttc ctgggctgct 480 

acggggccat ccaggaatcc cagtgcctgc tggggacgtt cttcacctgc ctggtcatcc 540 

tgtttgcctg tgaggtggcc gccggcatct ggggctttgt caacaaggac cagatcgcca 600 

aggatgtgaa gcagttctat gaccaggccc tacagcaggc cgtggtggat gatgacgcca 660 

acaacgccaa ggctgtggtg aagaccttcc acgagacgct tgactgctgt ggctccagca 720 

cactgactgc tttgaccacc tcagtgctca agaacaattt gtgtccctcg ggcagcaaca 780 

tcatcagcaa cctcttcaag gaggactgcc accagaagat cgatgacctc ttctccggga 84 0 

agctgtacct ggccgcgacc acgctaaggg c 871 

<210> 78 
<211> 1283 
<212> DNA 
<213> Homo sapien 
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<400> 78 



ggaatcacat 


cgagccactg 


aaaattcagt 


ctttcccaaa 


gctcaggatt 


cttcgaaaag 


actttcagat 


tgaagggtac 


aatccgcatc 


gcgctttcaa 


aggagcccga 


aggacattgt 


gtaaaagctc 


tttttgctct 


aaaagaaaaa 


atcagttatt 


aatctttaag 


gatgttgcca 


ataataaaag 


gctttgagtt 


aactcactga 


aagtgaggag 


aatgaaatgt 


atgtgctctt 


acgtacttat 


aaagaaggtt 


ggtgaatctc 


ttttaagaat 


ttcacaagct 


attccctcaa 


atgatgtaga 


gtgtggttat 


gaactttaaa 


aagaagtgtt 


ctgcattcgt 


ccacgctttg 


ttccttccta 


aaatagatta 


aagaactctc 


ggatgctact 


taaaagagta 


tattttagaa 


ctcattttaa 


ctgcatctta 


tcctcaaaat 


tttttttttt 


taaactttta 


taaccttaaa 


attttgccct 


cattagcttc 


agcatggtgt 


aaggaaaaga 


tgcaaaacca 


cttcggggtt 


taccagatac 


ccccggtgtt 


gcacgactat 


aacagaacaa 


ggaattaccc 


caacaagtta 


tttaatgtct 


aggtgccagc 


ccttgatata 


ttgggttagt 


taaatcgtcg 


act 



<210> 79 

<211> 1169 

<212> DNA 

<213> Homo sapien 

<400> 79 

ttcatttctt aattggcatt tagtttagca 
cttcagcagt tctagtgttt agatgtggga 
atctctaaaa gtggcttgtc cctgcaggat 
cacatttcat tgagttgata ccagcagatt 
actacagcac ctcaccacat tctccctctc 
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ggctcgatgt 


gcttcagcga 


gaacccagac 


60 


ttgagaaaat 


tgatgacttc 


aaagccgaag 


120 


caactattaa 


aatggaaatg 


gctgtttagg 


180 


cagtctttag 


gggttgggct 


ggatgccgag 


240 


ggaactaggt 


caaaaatctg 


tccgtgacct 


300 


ctggcaaatg 


taactgtgcc 


agttctttcc 


360 


gggtatctga 


caatgctgag 


gttatgaaca 


420 


agcaaaaaca 


tgtatgtgca 


tttcaatccc 


480 


acaagctatt 


tttggaatat 


ttccagaata 


540 


atctgaggga 


gctgagtaac 


accatcgatc 


600 


gttatagttg 


ttttatatgt 


tgctataata 


660 


ttcattctgt 


actgccactt 


atctgctcag 


720 


cttaagtaaa 


catgtgctgt 


attctggttt 


780 


ataatagcga 


atatattttg 


ccctattttt 


840 


ataatgacca 


tttaggatag 


agtttttttt 


900 


gggttatttt 


aaaataatct 


atggactacc 


960 


gacttctcta 


ataatatgct 


tagattaagc 


1020 


aatcagtgaa 


atatttttcc 


cttcgttgca 


1080 


ttttattctg 


ctaatttatg 


acaagtgtta 


1140 


tgcaacatgt 


tgcttatttt 


caaattacag 


1200 


gctatttttg 


taagaacatc 


ctcctggact 


1260 








1283 


tagtaatctt 


ttttttttaa 


tttgctgcca 


60 


gaggttaagt 


aagatctgta 


ttgccctatt 


120 


agtgaatgct 


tgtctgtacc 


attggagtta 


180 


tttttcaggg 


aaaaataatc 


caatctcata 


240 


ctaacagtgt 


agagacttag 


atttaattgc 


300 
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^ 3 ^ ^ ^ y™f ^ ti* s 

ucaacagcud 


tttctggtgg gatgttgaag 


aaagctagtg 


ggcaaaggat 






cogcicaccgg 


tgatgggctc 


ccaggcttgg 


cggtggcttg 


ctttctgcac 


ATA 






gcatctttgg 


ctctgggatt 


aataattcat 


ctgccttctc 


A Q l\ 






cctttctctt 


accccttccc 


ggtagtgtga 


agtgaggggg 


DH U 


ccucuccccc 


tcct t CCCCt 


tcctctgtga 


tccaccttcc 


tttttacccC 


gccccgcggc 


600 


ggctccgccc 


ct cacct tea 


tggacgactc 


agaggtggag 


tcgaccgcca 


gcatcttggc 






gaacaagagg 


cccagtttga 


gaagctgacc 


cgggcgctgg 


aggaggaacg 


"7 o n 


^^^^^ ^ ^ A ^ ^ 


tcggcgcagc 


tggaacgcgt 


ccgggtctca 


ccacaagatg 


ccaacccact 


nan 
780 


caccfsccaac 


ggcacacccc 


cctttcgaaa 


gaaatgcaaa 


aaaaagagta 


tat cttcaag 


o4 0 




cCat c taagg 


aaagcaaaat 


catttcccaa 


a ugcataLca 


t tz tgtgagaa 




tctctcatta 


atatcctgaa 


tcattcattt 


cagctaaggc 


ttcatgttga 


ctcgatatgt 


960 


catctaggaa 


agtactattt 


catggtccaa 


acctgttgcc 


acagttggta 


aggctttcct 


1020 


ttaagtgtga 


aatatttaca 


tgaaattttc 


tcttttaaag 


ttctttatag 


ggttagggtg 


1080 


tgggaaaacg 


ctatattaat 


aaatctgtag 


tgtttagtgt 


c tatacgccc 


agaaccagag 


114 0 


tagactggat 


tgaaagatgg 


actgggtct 








1169 


<210> 80 

<211> 406 

<212> DNA 

<213> Homo sapien 












<400> 80 
gatgatcata 


tattagcacg 


gttcataatc 


atgtcgacgc 


gcagtgtatg 


atattgcaga 


o 0 


atcggcttac gtgtcgcgcg 


agtcctccgg ggtatggtca 


tgtacaccag 


caaggaccgc 


1 "3 

1^ u 


tatttctatt 


ttgggaagct 


tgatggccag atctcctctg cctaccccag ccaagagggg 


180 


caggtgctgg 


tgggcatcta 


tggccagtat 


caactccttg gcatcaagag 


cattggcttt 


240 


gaacggaatt 


atccactaga 


ggagccgacc actgagccac cagttaatct 


cacatactca 


300 


gcaaactcac 


ccgtgggtcg 


ctagggtggg 


gtacggggcc 


atccgagctg 


aggccatctg 


360 


ggtggtggtg gccgatggta 


cctgcccggc 


ggcgctcgaa 


acgccg 




406 


<210> 81 
<211> 1902 
<212> DNA 
<213> Homo sapien 












<400> 81 
tataggcgcc 


tgggtttcta 


atctgtcgag 


cggcgcagtg 


tgatggatga gcggccgccc 


60 
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aaacActct^aa 


W^OL>3 W W W W 


^N^^aw^waww 


yy wway ay y w 


catgctgctg 


ctgctcacgc 


120 




yyyyyy*-^^^ 


acctaaocaa 


aaa a aa t* a t* a 
^Maaya ^y wn 


taoccctooa 

www^wj^a 


aaaaacaaat 
yy «yy *-»«y 


180 


a t" t" ^ Cfiaoit^ 


eaetaaaoac 

wawwycioi^wiw 


tacaaccata 


aaa teaeaaa 

aoa uwawayy 


ac tacaaat a 


tctataaatc 

w>rW^ *'^«J3 WW 


240 


ttct CPtaat 

U h.V« ip ^. W UVj^ u 


aaa a aot or tc 

yaaaayw^ ww 


caaataaaac 


ttaaaaaetc 

w^*^ya^awi>w 


ctaaaacata 

w wv^yyawsj 1.%^ 


aaactaaaaa 


300 


v»wu ^^yy ^yy 


oa atarT'Paa 

yao^awwwca^>j 


aaaat eacee 

>|4 w WWA W WW 


t ar'aaeeaaa 

VvVjwaywwa^lj 


cgaatacatc 


acaaaaatct 


360 


u uy L.^yu>u.L.L. 


f^r'aaof^ f t" r* 
wwciciywu uuw 


nt" c caoaat a 

WL.WWyyyywCl 


t" aa t" a t" a t a 
uyy L.WCI L.yua 


caccaacaaa 

W W W w w ^ ^ 


aaccactat t 

y Cft W W *5 W ki» W W W 


420 




oaaar't* t" era t 
vjaayw^ w^aw 


aaccaaat ct 

W W a^ W& W W 


eet"etaeeta 

wwwwwywwwa 


ccccaaccaa 


aaaaaacaaa 

yayyyy woyy 


480 


act' aaY aaa 


eat etataoc 

waiipWwai,.^^w 


eaatateaae 

WGi^ wa ^ WuuW 


tcct taocat 

w w w w w wa w 


caaaaacat t 


aactt taaat 


540 


aaaat tatcc 

^ 0 W W M W W W 


actaaaaaao 


ccoaccacta 


aaccaccaat. 


caa^ctcaca 


tacCcagcaa 


600 




aaateaetacr 
yyy uuyuuay 


aataaaat a t 

yy wyyyy waL. 


^ W WGi w W W 


aaactaaaac 


catctaaata 

wak. wi.^y^ wy 


660 


ytgytygcty 


a t act t" a e t aa 
ouyy WGiw^yy 


aat" aaetcraa 

ay waaw ^y ^y 


t eaaaa eaet 

w wy yya wy w V 


aaa tetaaa t 

yaawww^aaw 


ccaccaataa 

w^^a w waa w u a 


720 


cii,.oioiGiyi..uuv> 


t oeaoaa t ea 


at" aea t eeao 
yuywabwway 


aa 1 1" ao t e e t 

ya^ ^yy 


t aaa t e t aaa 

wyya i»wwyyy 


at a eaaeeaa 

ywawaawwaa 


780 


acrcct tenet 


act ect taaa 

^ W W W W w %■» y 


gacaaagtcc 


ccaatac tac 

w wo^ w y ^ 


agcccagtiga 


act gaga tga 


840 


yyyy t-ayyy^ 




wc.^wwyc»^>lljti 


wd ^ o d a ^ a ^ CL 


^ wo W Wd w 


cccc tcaaaa 

w w ^ w w wo^ 


900 


y uy^uy uyya 


tc t c tahacc 

w W w W ^ ^ w WW 


agt cccacaa 


tct t tgaaga 


gCcaggct tc 


siaggccgcca 


960 




tccctcaccc 

Km WWW W W0 W W W 


caciacccacc 

wa^ ^ w w w^ w w 


caacaaacct 

wwywy*j*j^w w 


ctoctaacca 


ctcaatccca 


1020 


croattct eta 


aataeteaaa 

w^ w w waci^ 


oooce t caaci 
y y y ^"^y y 


aaaaccactc 

w wa w ^ w 


actcatccat 

^&w w w^ w www w 


tcactcaaca 


1080 


a a ea 1 1 f" qcc* 


aaaaeecit at 
ciciyy wwy ^y ^ 


eet ea eeaaa 

wwwwawwavjy 


wtw^ww '^yy^ 


actaaaaata 




1140 


y ^^oy y c c 


oy w i»y ay i.y w 


at a t a eaet a 

ywa wawawwy 


uyy uyyyyyt- 


taatacaaaa 




1200 


creaat t* f*aot 

y way c> l> w 


t taceateta 

V. i»y w wu w w w*|^ 


ea taaa taaa 


aaeaeataea 

^a w w a w \ nfiii 


aat cattata 

y y u wa w wy wy 


cgtaggcacc 


1260 


i.y WW way ay y 


t aaaat aaaa 
t y »yy wyaya 


aaaa t atac t 
yyy ** i^y v-y w l. 


teaaaaaaat 

i» wayy y aay w 


Ct tcct aaaa 

www w w w a^ 


aaaacaa tat 


1320 


efoaact" a aa 

w ^ ^ ^ z3 a 


^ W W W OtuCkv^^ u 


tgaaggacaa 


tt aaccaaat 


ggagacaaag 


aaaaaaaaaa 
yy "yy^yyyy 


1380 


at tacaaat t 

a V oi WG(^^ w w 


gagggc^caa 


ca taaacaaa 


gacaggagat 


acaaataact 


ggggtaat tt 


1440 


CT^oaeaotafl 
y u wciwoy uay 


eaa eaaaaaa 
waci^ciyyaaa 


heaatar*aaa 
uway ws^^aay 


eeet aaaeet 
wv»VwwyyawVi»i» 


aaeeea 1 1 et 

w^wwwaiirWWL* 


tea tat eeee 

wwawywwwww 


1500 


u Uw wwdciy WW 


t wwcicly wwwa 


wci uy yy waw l. 


uy wwociy a u w 


ay ay w ^ wway 


yyy wwu wway 




yy**-3ayy*-*- 


t teaateet t 

w ^ way w w w ^ b 


1" aaaa eea t o 

^y y y a wwa ^y 


ateeaeaaaa 

awwwawav4a«^ 


aaaaat taat 


ect act taaa 
ww^awb y y 


1620 


acacaggaag 


caacacatgt 


aatgcaacag 


ccaagcaccc 


aacgatcatc 


taatgcaaca 


1680 


gccaaacacc 


cagcgatcat 


ctaatgcaac 


agccaaacac 


ccagcgatca 


tctaatgcaa 


1740 


cagccaaaca 


cccagcgatc 


atctaatgca 


acagccaaac 


acccagcgat 


catctaatgc 


1800 


aacagccaaa 


cacccagtga 


tcatctaatg 


caacagccaa 


acacccagtg 


atcatctaat 


1860 


gcaacagcca 


aacagacaag 


tgatcatcta 


atgcaacagc 


ca 




1902 
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<210> 82 

<211> 1911 

<212> DNA 

<213> Homo sapien 

<400> 82 



catgccgagc ggcgcagtgt gatggatcgg ccgccgggca ggtggagggt gcccggcaca 


60 


accagacgcc 


cagccacggc gagagccctg ggatgcaccg gccagaggcc atgctgctgc 


120 


tgctcacgct 


tgccctcctg gggggcccca 


cctgggcagg gaagatgtat ggccctggag 


180 


gaggcaagta 


tttcagcacc 


actgaagact 


acgaccatga 


aatcacaggg ctgcgggtgt 


240 


ctgtaggtct 


tctcctggtg 


aaaagtgtcc 


aggtgaaact 


tggagactcc tgggacgtga 


300 


aactgggagc 


cttaggtggg 


aatacccagg 


aagtcaccct 


gcagccaggc gaatacatca 


360 


caaaagtctt 


tgtcgccttc 


caagctttcc 


tccggggtat 


ggtcatgtac accagcaagg 


420 


accgctattt ctattttggg aagcttgatg gccagatctc ctctgcctac cccagccaag 


480 


aggggcaggt 


gctggtgggc 


atctatggcc 


agtatcaact 


ccttggcatc aagagcattg 


540 


gctttgaatg 


gaattatcca 


ctagaggagc 


cgaccactga 


gccaccagtt aatctcacat 


600 


actcagcaaa 


ctcacccgtg 


ggtcgctagg 


gtggggtatg 


gggccatccg agctgaggcc 


660 


atctgggtgg 


tggcggctga 


tggtactgga 


gtaactgagt 


cgggacgctg aatctgaatc 


720 


caccaataaa 


taaagcttct 


gcagaatcag 


tgcatccagg 


attggtcctt ggatctgggg 


780 


tacaaccaaa 


gccttccctg 


ctccttggag 


acaaagtccc 


cagtgctgca gcccagtgaa 


840 


ctgagatgag 


gggtagggca 


aaggtgactc 


tgccgaggac 


agaaagagag cagcaccacc 


900 


ccctcagagg 


tgctgtggat 


ctctgtgcca 


gtcccacaat 


ctttgaagag tcaggcttca 


960 


aggccgccac 


tccccactgt 


ccctcacccc 


aggcccaccc 


agcgsacctc tgctggccac 


1020 


tcagtcccgg gattctctga 


gtgcccaagg 


ggcctcaggg 


aagccactca ctcatccatt 


1080 


cactcagcaa 


acatttgcca 


aggccgtgtc 


ctcaccaggc 


tcgcctggca ctgggggtat 


1140 


aaagaagagg 


ccaggtttca 


gctgagtgcg 


catacactgt 


ggtgggggct ggtgcagagg 


1200 


cagatagtgg 


cagttcactt 


tgccatctgc 


atggatgggg 


acacatgcag gtcattgtgc 


1260 


gtaggcacct 


gcccagaggt 


gaggtgagag 


ggatgtgctt 


cagggaagtc ttcctagagg 


1320 


aggcaatgtc 


tgagctaagt 


cctaaaggat 


gaaggacaat 


tggccaagtg gagacaaagg 


1380 


gaggagggga 


ttacaggttg agggctcaac 


atgagcaaag 


acaggagatg cagatggctg 


1440 


gggtaatttg 


tcacagtagc 


aacaggaaat 


cagtacaagc 


cctggacctg gcccattctt 


1500 


catgtcccct 


tcccaagcct 


ccaagcccac 


acgggcactt 


gccaagatca gagccccagg 


1560 


ggcctccagg 


gacggggttt 


tcagtccttt 


gggaccatga 


tccacagaga gaaattgatc 


1620 
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cacaQQaacrc 


aacacatgta 


acgcaacagc 


caagcaccca 


acgatcatct 


1680 


aatgcaacag 


ccaaacaccc 


agcgatcatc 


taatgcaaca 


gccaaacacc 


cagcgatcat 


1740 


ctaatgcaac 


agccaaacac 


ccagcgatca 


tctaacgcaa 


cagccaaaca 


cccagcgatc 


1800 


atctaatgca 


acagccaaac 


acccagtgac 


catctaatgc 


aacagccaaa 


cacccagtga 


1860 


tcatctaatg 


caacagccaa 


acagacaagt 


gatcatctaa 


tgcaacagcc 


a 


1911 


<210> 83 
<211> 1852 
<212> DNA 
<213> Homo sapien 












<400> 83 
atgcatgtcg 


agcggcgcag 


cgtgatggat 


cgtggtcgcg 


gcgaggtggc 


catgctgctg 


60 


ctgctcacgc 


ttgccctcct 


ggggggcccc 


acctgggcag 


ggaagatgta 


tggccctgga 


120 


ggaggcaagt 


atttcagcac 


cactgaagac 


tacgaccatg 


aaatcacagg 


gctgcgggtg 


180 


tccgtaggtc 


ttctcctggt 


gaaaagtgcc 


caggtgaaac 


ttggagactc 


ctgggacgtg 


240 


aaactgggag 


ccttaggtgg 


gaatacccag 


gaagtcaccc 


tgcagccagg 


cgaatacatc 


300 


acaaaagtct 


ttgtcgccct 


ccaagctttc 


ctccggggta 


tggtcatgta 


caccagcaag 


360 


gaccgccatt 


tctattctgg 


gaagcttgat 


ggccagatct 


cctctgccta 


ccccagccaa 


420 


gaggggcagg 


tgctggtggg 


catctacggc 


cagtatcaac 


tccttggcat 


caagagcatt 


480 


ggctttgaat 


ggaattatcc 


actagaggag 


ccgaccactg 


agccaccagt 


taatctcaca 


540 


tactcagcaa 


actcacccgt 


gggtcgctag 


ggtggggtat 


ggggccatcc 


gagctgaggc 


600 


catctgggtg 


gtggtggctg 


atggtactgg 


agtaaccgag 


tcgggacgct 


gaatctgaat 


660 


ccaccaataa 


ataaagcttc 


tgcagaatca 


gtgcatccag 


gattggtcct 


tggatctggg 


720 


gtacaaccaa 


agccttccct 


gctccttgga 


gacaaagtcc 


ccagtgctgc 


agcccagtga 


780 


actgagatga 


ggggtagggc 


aaaggtgact 


ctgccgagga 


cagaaagaga 


gcagcaccac 


840 


cccctcagag 


gtgctgtgga 


tctctgtgcc 


agtcccacaa 


tctttgaaga 


gtcaggcttc 


900 


aaggccgcca 


ctccccactg 


tccctcaccc 


caggcccacc 


cagcgggccc 


ctgctggcca 


960 


ctcagccccg 


ggattctctg 


agtgctcaag 


gggcctcagg 


gaagccactc 


actcatccat 


1020 


tcacccagca 


aacacttgcc 


aaggccgtgt 


cctcaccagg 


ctcgcctggc 


actgggggta 


1080 


taaagaagag 


gccaggtttc 


agctgagtgc 


gtatacactg 


tggtgggggc 


tggtgcagag 


1140 


gcagatagtg 


gcagttcact 


ttgccatctg 


catggatggg 


gacacatgca 


ggtcattgtg 


1200 


cgtaggcacc 


tgcccagagg 


tgaggtgaga 


gggatgtgct 


tcagggaagt 


cttcctagag 


1260 


gaggcaatgt 


ctgagctaag 


tcttaaagga 


tgaaggacaa 


ttggccaagt 


ggagacaaag 


1320 
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ggaggagggg 


attacaggtt 


gagggctcaa 


catgagcaaa 


gacaggagat 


gcagatggcc 


1380 


ggggtaattt 


gtcacagtag 


caacaggaaa 


tcagtacaag 


ccctggaccc 


ggcccattct 


1440 


ccatgtcccc 


cccccaagcc 


tccaagccca 


catgggcact 


tgccaagatc 


agagctccag 


1500 


gggcctccag 


ggacggggct 


ttcagtcctt 


tgggaccatg 


atccacagag 


agaaattgat 


1560 


cctacttgag 


acacaggaag 


caacacatgt 


aacgcaacag 


ccaagcaccc 


aacgatcatc 


1620 


taatgcaaca 


gccaaacacc 


cagcgatcat 


ctaatgcaac 


agccaaacac 


ccagcgatca 


1680 


tctaatgcaa 


cagccaaaca 


cccagcgatc 


atctaatgca 


acagccaaac 


acccagcgat 


1740 


catctaatgc 


aacagccaaa 


cacccagtga 


tcatctaatg 


caacagccaa 


acacccagtg 


1800 


atcatctaat 


gcaacagcca 








ca 


1852 


<210> 84 
<211> 1798 
<212> DNA 
<213> Homo sapien 












<400> 84 
catgctcgag 


cggcgcaktg 


tgatggatcg 


cccgggcagg 


tccagtcaca 


gatgtatggc 


60 


cctggaggag 


gcaagtattt 


cagcaccact 


gaagactacg 


accatgaaat 


cacagggctg 


120 


cgggtgtctg 


taggtcttct 


cctggtgaaa 


agtgtccagg 


tgaaacctgg 


agactcctgg 


180 


gacgtgaaac 


tgggagcctt 


aggtgggaat 


acccaggaag 


tcaccctgca 


gccaggcgaa 


240 


tacatcacaa 


aagtctttgt 


cgccttccaa 


gctttcctcc 


ggggtatggt 


catgtacacc 


300 


agcaaggacc 


gctatttcta 


ttttgggaag 


cttgatggcc 


agatctcctc 


tgcctacccc 


360 


agccaagagg 


ggcaggtgct 


ggtgggcatc 


tatggccagt 


atcaactcct 


tggcatcaag 


420 


agcattggct 


ttgaatggaa 


ttatccacta 


gaggagccga 


ccactgagcc 


accagttaat 


480 


ctcacatact 


cagcaaactc 


acccgtgggt 


cgctagggtg 


gggtatgggg 


ccatccgagc 


540 


tgaggccatc 


tgggtggtgg 


tggctgatgg 


tactggagta 


actgagtcgg 


gacgctgaat 


600 


ctgaatccac 


caataaataa 


agcttctgca 


gaatcagtgc 


atccaggatt 


ggtccttgga 


660 


tctggggtac 


aaccaaagcc 


ttccctgctc 


cttggagaca 


aagtccccag 


tgctgcagcc 


720 


cagtgaactg 


agatgagggg 


tagggcaaag 


gtgactctgc 


cgaggacaga 


aagagagcag 


780 


caccaccccc 


tcagaggtgc 


tgtggatctc 


tgtgccagtc 


ccacaatctt 


tgaagagtca 


840 


ggcttcaagg 


ccgccactcc 


ccactgtccc 


tcaccccagg 


cccacccagc 


gggcctctgc 


900 


tggccactca 


gtcccgggat 


tctctgagtg 


ctcaaggggc 


ctcagggaag 


ccactcactc 


960 


atccattcac 


tcagcaaaca 


tttgccaagg 


ccgtgtcccc 


accaggctcg 


cctggcactg 


1020 


ggggtataaa 


gaagaggcca 


ggtttcagct 


gagtgcgtat 


acactgtggt 


gggggctggt 


1080 
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gcagaggcag 


atagtggcag 


ttcactttgc 


catctgcatg 


gatggggaca 


catgcaggtc 


1140 


actgtgcgta 


ggcacccgcc 


cagaggtgag 


gtgagaggga 


tgtgcttcag 


ggaagtcttc 


1200 


ctagaggagg 


caatgtctga 


gctaagtctt 


aaaggatgaa 


ggacaattgg 


ccaagtggag 


1260 


acaaagggag 


gaggggatta 


caggttgagg 


gctcaacatg 


agcaaagaca 


ggagatgcag 


1320 


atggctgggg 


taattcgtca 


cagtagcaac 


aggaaatcag 


tacaagccct 


ggacctggcc 


1380 


cattcctcat 


gtccccttcc 


caagccccca 


agcccacatg 


ggcacttgcc 


aagatcagag 


1440 


ccccaggggc 


ctccagggac 


ggggttttca 


gtcctttggg 


accatgatcc 


acagagagaa 


1500 


attgatccta 


cttgagacac 


aggaagcaac 


acatgtaatg 


caacagccaa 


gcacccaacg 


1560 


atcatctaat 


gcaacagcca 


aacacccagc 


gatcatctaa 


tgcaacagcc 


aaacacccag 


1620 


cgatcatcta 


atgcaacagc 


caaacaccca 


gcgatcacct 


aatgcaacag 


ccaaacaccc 


1680 


agcgatcatc 


taatgcaaca 


gccaaacacc 


cagtgatcat 


ctaatgcaac 


agccaaacac 


1740 


ccagtgatca 


tctaatgcaa 


cagccaaaca 


gacaagtgat 


catctaatgc 


aacagcca 


1798 


<210> 85 
<211> 3099 
<212> DNA 
<213> Homo sapien 












<400> 85 
ggcgaatggg 


cctctagatg 


catgctcgag 


cggcgcagtg 


tgatggatcg 


ccgcggcagg 


60 


gtcacaggcg 


agagccctgg 


gatgcaccgg 


ccagaggcca 


tgctgctgct 


gctcacgctt 


120 


gccctcctgg 


ggggccccac 


ctgggcaggg 


agtaagtcag 


tggggtctgc 


cctcaatctc 


180 


ccctgcctcc 


ctccaggaga 


gccagggact 


cacccggccc 


ttgtcccaga 


ctaactctgg 


240 


tcacagaacc 


atcctgtctg 


cctggagggg 


cggggtcccc 


tgttccggca 


gaggtcaccc 


300 


ccatatcacc 


gcatggggat 


tttcttccct 


ttgggtctct 


cttttcttca 


gagatgtatg 


360 


gccctggagg 


aggcaagtat 


ttcagcacca 


ctgaagacta 


cgaccatgaa 


atcacagggc 


420 


tgcgggtgtc 


tgtaggtctt 


ctcctggtga 


aaaggtgagt 


agggctatgg 


tcatgggccc 


480 


agcgccatgt 


cccctcccat 


cccacagttt 


caggaactca 


gggcagcggg 


taagcacccg 


540 


tggccacttt 


cgccacacat 


gcctggctac 


tgtcgatgct 


tcctggctcc 


cgctgatgct 


600 


tcctggctgg 


agcggacacg 


gtcagaccgt 


cctccctacc 


ttctcccttc 


aacccaagct 


660 


caactcaacc 


aaaaatggcc 


cctctgtccc 


catgcctgac 


aggaaagtca 


ggggaaagtc 


720 


tgtccgatta 


ctgtcaaaga 


agacaggagg 


taagggtcag 


agtggaccac 


tgactgaata 


780 


tgagtcgcag 


aagtgttaga 


ggcagaagtc 


cagggccatt 


tccttaacat 


cgaagtgtct 


840 


ctgctggagg 


tctgggatgg 


atttttgccc 


tgcatttaga 


agttctgggg 


tcctgggaga 


900 
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ggggagagaa 


gcccaatagc 


agaggagaca 


atcctgggag 


agcaccaggg 


tgagggaggg 


gcgagtgtgg 


gaagtcacct 


gcccctcggc 


gctcgggagg 


tccaggctcg 


gccagaggca 


ggtgccttct 


gggtcacgga 


gacctggcgc 


tgactcctcc 


caaaagtcac 


acccactttg 


ggtaacgagc 


tcaagatcac 


agggcccaaa 


gtctgtgcca 


gagacccaaa 


gtctgatggt 


tgggggcgct 


gaggacccgg 


gatgtgctgg 


gcctctctcc 


cagtgtccag 


gtgaaacttg 


taggtgggaa 


tacccaggaa 


gtcaccctgc 


tcgccttcca 


agctttcctc 


cggggtatgg 


attttgggaa 


gcttgatggc 


cagatctcct 


tggtgggcat 


ctatggccag 


tatcaactcc 


attatccact 


agaggagccg 


accactgagc 


cacccgtggg 


tcgctagggt 


ggggtatggg 


gtggctgatg 


gtactggagt 


aactgagtcg 


aagcttctgc 


agaatcagtg 


cacccaggat 


cttccctgct 


ccttggagac 


aaagtcccca 


gtagggcaaa 


ggtgactctg 


ccgaggacag 


ctgtggatct 


ctgtgccagt 


cccacaatct 


cccactgtcc 


ctcaccccag 


gcccacccag 


ttctctgagt 


gctcaagggg 


cctcagggaa 


atttgccaag 


gccgtgtcct 


caccaggctc 


aggtctcagc 


tgagtgcgta 


tacactgtgg 


gttcactttg 


ccatctgcat 


ggatggggac 


ccagaggtga 


ggtgagaggg 


atgtgcttca 


agctaagtct 


taaaggatga 


aggacaattg 


acaggttgag 


ggctcaacat 


gagcaaagac 


acagtagcaa 


caggaaatca 


gtacaagccc 
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gagtgtgggc 


ggggcgagcc 


ggaggggcgc 


960 


gtgaagatga 


gccccgtcag 


ggaagcgctg 


1020 


ctgtgagctg 


ctctgcttgg 


agtgactaag 


1080 


gctcatatgt 


gggccacagt 


gacggcagcc 


1140 


tgcacgcagc 


tctcctcacc 


aggatctcag 


1200 


cagacgggga 


aactgagtcc 


ggagaggctg 


1260 


agtggtagaa 


tcagggttgg 


tgaccagtga 


1320 


gctggactct 


ctgcatcccg 


ggaaggagga 


1380 


gccaccccag 


atctggacgt 


ccaaagcttt 


1440 


gagactcctg 


ggacgtgaaa 


ctgggagcct 


1500 


agccaggcga 


atacatcaca 


aaagtctttg 


1560 


tcatgtacac 


cagcaaggac 


cgctatttct 


1620 


ctgcctaccc 


cagccaagag 


gggcaggtgc 


1680 


ttggcatcaa 


gagcattggc 


tttgaatgga 


1740 


caccagttaa 


tctcacatac 


tcagcaaact 


1800 


gccatccgag 


ctgaggccat 


ctgggtggtg 


1860 


ggacgctgaa 


tctgaatcca 


ccaataaata 


1920 


tggtccttgg 


atctggggta 


caaccaaagc 


1980 


gtgctgcagc 


ccagtgaact 


gagatgaggg 


2040 


aaagagagca 


gcaccacccc 


ctcagaggtg 


2100 


ttgaagagtc 


aggcttcaag gccgccactc 


2160 


cgggcctctg 


ctggccactc 


agtcccggga 


2220 


gccactcact 


catccattca 


ctcagcaaac 


2280 


gcctggcacc 


gggggtataa 


agaagaggcc 


2340 


tgggggctgg 


tgcagaggca 


gatagtggca 


2400 


acatgcaggt 


cattgtgcgt 


aggcacctgc 


2460 


gggaagtctt 


cctagaggag gcaatgtctg 


2520 


gccaagtgga 


gacaaaggga 


ggaggggatt 


2580 


aggagatgca 


gatggctggg gtaatttgtc 


2640 


tggacctggc 


ccattcttca 


tgtccccttc 


2700 
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ccaagcctcc aagcccacat 


gggcacttgc 


117 

caagatcaga gctccagggg cctccaggga 


T T C n 


cggggttttc 


agtcctttgg 


gaccatgatc 


cacagagaga aattgatcct 


acttgagaca 




caggaagcaa 


cacatgtaat 


gcaacagcca 


agcacccaac gatcatctaa 


tgcaacagcc 




aaacacccag 


cgatcatcta 


atgcaacagc 


caaacaccca gcgatcatct 


aatgcaacag 


2940 


ccaaacaccc 


agcgatcatc 


taatgcaaca 


gccaaacacc cagcgatcat 


ctaatgcaac 


3000 


agccaaacac 


ccagtgatca 


tctaatgcaa 


cagccaaaca cccagtgatc 


atctaatgca 


3060 


acagccaaac 


agacaagtga 


tcatctaatg 


caacagcca 




3099 


<210> 86 
<211> 1547 
<212> DNA 
<213> Homo sapien 










<400> 86 
cggcttagcg 


tggtcgcggc 


gagtcctccg 


gggtatggtc atgtacacca 


gcaaggaccg 


D U 


ctatttctat 


tttgggaagc 


ttgatggcca 


gatctcctct gcctacccca 


gccaagaggg 




gcaggtgctg 


gtgggcatct 


atggccagta 


tcaactcctt ggcatcaaga 


gcattggctt 




tgaatggaat 


tatccactag 


aggagccgac 


cactgagcca ccagttaatc 


ccacatactc 


o A n 


agcaaactca 


cccgtgggtc 


gctagggtgg 


ggtatggggc catccgagct 


gaggccatct 


^ uu 


gggtggtggt 


ggctgatggt 


actggagtaa 


ctgagtcggg acgctgaatc 


tgaatccacc 


1 C A 
J D u 


aataaataaa 


gcttctgcag 


aatcagtgca 


tccaggattg gtccctggat 


ctggggtaca 




accaaagcct 


tccctgctcc 


ttggagacaa 


agtccccagt gctgcagccc 


agtgaactga 


A Q f\ 

4 oU 


gatgaggggt 


agggcaaagg 


tgactccgcc 


gaggacagaa agagagcagc 


accaccccct 


54 0 


cagaggtgct 


gtggatctct 


gtgccagtcc 


cacaatcttt gaagagtcag 


gcttcaaggc 


C f\f\ 


cgccactccc 


cactgtccct 


caccccaggc 


ccacccagcg ggcctctgct 


ggccactcag 


oo U 


tcccgggatt 


ctctgagtgc 


tcaaggggcc 


tcagggaagc cactcactca 


tccattcact 


Ton 


cagcaaacat 


ttgccaaggc 


cgtgtcctca 


ccaggctcgc ctggcactgg gggtataaag 


•7 Q ft 


aagaggccag 


gtttcagctg 


agtgcgtata 


cactgtggtg ggggctggtg cagaggcaga 


840 


tagtggcagt 


tcactttgcc 


atctgcatgg 


atggggacac atgcaggtca 


ttgtgcgtag 


900 


gcacctgccc 


agaggtgagg 


tgagagggat 


gtgcttcagg gaagtcttcc 


tagaggaggc 


o f n 


aatgtctgag 


ctaagtctta 


aaggatgaag 


gacaactggc caagtggaga 


caaagggagg 


1020 


aggggattac 


aggttgaggg 


cccaacatga 


gcaaagacag gagatgcaga 


tggctggggt 


1080 


aatttgtcac 


agtagcaaca 


ggaaatcagt 


acaagccctg gacctggccc attcttcatg 


1140 


tccccttccc 


aagcctccaa 


gcccacatgg 


gcacttgcca agatcagagc 


tccaggggcc 


1200 
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tccagggacg gggttttcag tcctctggga ccatgatcca cagagagaaa ttgatcctac 1260 
ttgagacaca ggaagcaaca catgtaatgc aacagccaag cacccaacga tcatctaatg 1320 
caacagccaa acacccagcg atcatctaat gcaacagcca aacacccagc gatcatctaa 1380 
tgcaacagcc aaacacccag cgatcatcta atgcaacagc caaacaccca gcgatcatct 1440 
aacgcaacag ccaaacaccc agtgatcatc taatgcaaca gccaaacacc cagtgatcat 1500 
ctaatgcaac agccaaacag acaagtgatc atctaatgca acagcca 1547 

<210> 87 
<211> 2345 
<212> DNA 
<213> Homo sapien 

<400> 87 

cacgctcgag cggcgcagtg tgacggatcg agcggccgcc cgggcaggta cgtgctccgg 60 
gatcttcagc acccgcggcc gccatcgccg tcgcttggct tcttctggac tcatctgcgc 120 
cacttgtccg cttcacactc cgccgccatc atggtgaagc tcgcgaaggc aggtaaaaat 180 
caaggtgacc ccaagaaaat ggctcctcct ccaaaggagg tagaagaaga tagtgaagat 24 0 
gaggaaatgt cagaagatga agaagatgat agcagtggag aagaggtcgt catacctcag 300 
aagaaaggca agaaggctgc tgcaacctca gcaaagaagg tggtcgtttc cccaacaaaa 360 
aaggttgcag ttgccacacc agccaagaaa gcagctgtca ctccaggcaa aaaggcagca 420 
gcaacacctg ccaagaagac agttacacca gccaaagcag ttaccacacc tggcaagaag 480 
ggagccacac caggcaaagc attggtagca actcctggta agaagggtgc tgccatccca 54 0 
gccaaggggg caaagaatgg caagaatgcc aagaaggaag acagtgatga agaggaggat 600 
gatgacagtg aggaggatga ggaggatgac gaggacgagg atgaggatga agacgaaatt 660 
gaaccagcag cgatgaaagc agcagctgct gcccctgcct cagaggatga ggacgatgag 720 
gatgacgaag atgatgagga tgacgatgac gatgaggaag atgactctga agaagaagct 780 
atggagacta caccagccaa aggaaagaaa gctgcaaaag ttgttcctgt gaaagccaag 84 0 
aacgtggctg aggatgaaga tgaagaagag gatgatgagg acgaggatga cgacgacgac 900 
gaagatgatg aagatgatga tgatgaagat gatgaggagg aggaagaaga ggaggaggaa 960 

gagcctgtca aagaagcacc tggaaaacga aagaaggaaa tggccaaaca gaaagcagct 1020 

cctgaagcca agaaacagaa agtggaaggc acagaaccga ctacggcttt caatctcttt 1080 

gttggaaacc taaactttaa caaatctgct cctgaattaa aaactggtat cagcgatgtt 114 0 

tttgctaaaa atgatcttgc tgttgtggat gtcagaattg gtatgactag gaaatttggt 1200 

tacgtggatt ttgaatctgc tgaagacctg gagaaagcgt tggaactcac tggtttgaaa 1260 
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gtctttggca atgaaactaa actagagaaa 
gatgcgagaa cacttttggc taaaaatctc 
gaagtgtttg aagatgctgc ggagatcaga 
attgcttata ttgaatctaa gacagaagct 
ggaacagaga tcgatgggcg atctatttcc 
caagactata gaggtggaaa gaatagcact 
agcaacctct cctacagtgc aacagaagaa 
tttatcaaag taccccagaa ccaaaatggc 
gcctcactcg aagacgctaa agaagcttta 
agagcaatca ggctggaggc gaggaggctt 
caagccacaa ggaaagaaga cgaagtttga 
catttgaaag aaaggactcc ggggttttta 
aggacattcc aagacagtat acagtcctgt 
tccaagggca ataccgtgtt ggttttgacc 
agtgatagag ctaaccctta tctgtaagtt 
aaaccatttt ttcctacaaa tagtttgggt 
tttgtttttt ttttttttgc gttcgtgggg 
tcatggtttt tgcttcagcg gctttaggac 
aaaaa 

<210> 88 

<211> 1716 

<212> DNA 

<213> Homo sapien 

<400> 88 

catgctcgag cggcgcagtg tgatggatcg 
gatcttcagc acccgcggcc gccatcgccg 
cacttgtccg cttcacactc cgccgccatc 
caaggtgacc ccaagaaaat ggctcctcct 
gaggaaatgt cagaagatga agaagatgat 
aagaaaggca agaaggctgc tgcaacctca 
aaggttgcag ttgccacacc agccaagaaa 
gcaacacctg ccaagaagac agttacacca 
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(.^clctciaggaa 


aarta/^acF^aa 
oiciyci(»ayuaa 




1320 


t» WU UOIViipClClCl^ 


f* r*a ot" r'aacfa 


taaattiaaaa 


1380 


f" t" art t" csnf*si 


anrra t* oaaaa 

ayya^yyy*** 


oay wt»taoy*^v| 


1440 




oaa^cc c uyo 




1500 




\m uyyay ay aa 


ayy c^aBMCiw 


1560 


tggagtggcg 


aah^aaaaar* 


t rraattt ta 
\m *-yy •» w w i»a 


1620 




aaa^ar ^ t'aa 
aayL>awk 


yaaayv^aww^ 


1680 




aa hat'ar*a^^ 


f- a r aaa at* t* t 
uo L.ay ciy u i« 


1740 




a taaaaggga 


aat tgagggc 


1800 


^N»y oyy wyy w 




aacrotaacca 

*ja*j*j w«gvi v« wu 


1860 


o cay w w u w L>y 


Li^w^c^b>y^i« i» 


ttcccttt tc 


1920 


^uyu uovvWiUy 


a ^aa I" aa^a 
a w v«aa i.y 


yayixv^uuwuy 


1980 


yyuwUwwuuy 


aaaa h f oah r* 
yaaaLvw^yc^ 


taat taaeat 
uay L> L>aa^a c 


2040 


aask t" a t" t" f*a t" 


awBaav>i»\^«vW 


taaaaaat to 


2100 


^^aaa^■^^a^ 
uuyoovwuciu 


a^t■ot•^ teat 


occatataca 

v» wvvO vy wGi%iPa 


2160 


tttgttgttg 


tttctttttt 


ttgttttgtt 


2220 


ttgtaaaaga 


aaagaaagca 


gaatgtttta 


2280 


aaattaaaag 


tcaactctgg 


tgccaaaaaa 


2340 








2345 


ay uy y V. v»y w V 




wy uy ^ u v»y y 


60 


uwy (..u byy u 


t 1 1 1^ t aaa ^? 
u^utwuyyaw 


tea t ctaccfC 


120 


a uyy uyaay C 


^ /^rtoaa a aa^ 
t Cy Cy aayy c 


ayy uaaaaai> 


180 


w waaayy ayy 


t aaa a aaaaa 
uayaciyaay a 


taat aaaaa t 


240 


agcagtggag 


aagaggtcgt 


catacctcag 


300 


gcaaagaagg 


tggtcgtttc 


cccaacaaaa 


360 


gcagctgtca 


ctccaggcaa 


aaaggcagca 


420 


gccaaagcag 


ttaccacacc 


tggcaagaag 


480 



wo 2004/050860 PCT/US2003/040063 

120 

ggagccacac caggcaaagc attggtagca actcctggta agaagggtgc tgccatccca 540 

gccaaggggg caaagaatgg caagaatgcc aagaaggaag acagtgatga agaggaggat 600 

gatgacagtg aggaggatga ggaggatgac gaggacgagg atgaggatga agatgaaatt 660 

gaaccagcag cgatgaaagc agcagctgct gcccctgcct cagaggatga ggacgatgag 720 

gatgacgaag atgatgagga tgacgatgac gatgaggaag atgactctga agaagaagct 780 

atggagacta caccagccaa aggaaagaaa gctgcaaaag ttgttcctgt gaaagccaag 840 

aacgtggctg aggatgaaga tgaagaagag gatgatgagg acgaggatga cgacgacgac 900 

gaagatgacg aagatgatga tgatgaagac gatgaggagg aggaagaaga ggaggaggaa 960 

gagcctgtca aagaagcacc tggaaaacga aagaaggaaa tggccaaaca gaaagcagct 1020 

cctgaagcca agaaacagaa agtggaaggc acagaaccga ctacggcttt caatctcttc 1080 

gttggaaacc taaactttaa caaatctgct cctgaattaa aaactggtat cagcgatgtt 1140 

tttgctaaaa atgatcttgc tgttgtggat gtcagaattg gtatgactag gaaatttggt 1200 

tatgtggatt ttgaatctgc cgaagacctg gagaaagcgt tggaactcac tggtttgaaa 1260 

gtctttggca atgaaatcaa actagagaaa ccaaaaggaa aagacagtaa gaaagagcga 1320 

gatgcgagaa cacttttggc taaaaatctc ccttacaaag tcactcagga tgaattgaaa 1380 

gaagtgtttg aagatgctgc ggagatcaga ttagccagca aggatgggaa aagtaaaggg 1440 

attgcttata ttgaatttaa gacagaagct gatgcagaga aaacctttga agaaaagcag 1500 

ggaacagaga tcgatgggcg atctatctcc ctgtactata ctggagagaa aggtcaaaat 1560 

caagactata gaggtggaaa gaatagcact tggagtggtg aatcaaaaac tctggtttta 1620 

agcaacctct cctacagtgc aacagaagaa actcttcagg aagtatttga gaaagcaact 1680 

tttatcaaag tacctcggcc gcgaccacgc taagcc 1716 

<210> 89 
<211> 2068 
<212> DNA 
<213> Homo sapien 

<400> 89 

gcagattgca gggcccgggc tgacgggaag tgggtgggag ctgcctgcac acgcggtgcc 60 

gcggggcggg agtagaggcg gagggagggg acacgggctc attgcggtgt gcgccctgca 120 

ctctgtccct cactcgccgc cgacgacctg tctcgccgag cgcacgcctt gccgccgccc 180 

cgcagaaatg cttcggttac ccacagtctt tcgccagatg agaccggtgt ccagggtact 240 

ggctcctcat ctcactcggg cttatgccaa agatgtaaaa tttggtgcag atgcccgagc 300 

cttaatgctt caaggtgtag accttttagc cgatgctgtg gccgttacaa tggggccaaa 360 
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gggaagaaca 


gtgattattg 


agcagagctg 


gactgttgca 


aagtcaattg 


acttaaaaga 


tcaagatgtt 


gccaataaca 


caaatgaaga 


actggcacgc 


tctatagcca 


aggaaggctt 


ggaaatcagg 


agaggtgtga 


tgttagctgt 


gtctaaacct 


gtgaccaccc 


ctgaagaaat 


agacaaagaa 


attggcaata 


tcatctctga 


catcacagta 


aaggatggaa 


aaacactgaa 


gtttgatcga 


ggctatattt 


ctccatactt 


attccaggat 


gcctatgttc 


tgttgagtga 


acctgctctt 


gaaattgcca 


atgctcaccg 


tgatggagaa 


gctctaagta 


cactcgtctt 


ggcagtcaag 


gctccagggt 


ttggtgacaa 


tgctactggt 


ggtgcagcgt 


ttggagaaga 


gcctcatgac 


ttaggaaaag 


ttggagaggt 


aaaaggaaaa 


ggtgacaagg 


ctcaaattga 


agatgtcaca 


actagtgaat 


atgaaaagga 


agatggagtg 


gctgtgctga 


aggttggtgg 


agacagagtt 


acagatgccc 


ttaatgctac 


gggagggggt 


tgtgccctcc 


ttcgatgcat 


tgaagatcaa 


aaaattggta 


tagaaattat 


cattgctaag 


aatgcaggtg 


ttgaaggatc 


ctcagaagtt 


ggttatgatg 


ctatggctgg 


cattgaccca 


acaaaggtaa 


atggttgaac 


tgt taaCgta 


tcct cacat c 


ccaaatctct 


cagtagcaac 


agcaatagct 


cttgtgccca 


tagatattta 


atactccact 


aaaaggaagc 


cctggaggca 


ggaaaaaatc 


aggatggatc 


aatattttcc 


aagatacgca 


gaacaggg 
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gggaagtccc aaagtaacaa aagatggtgt 420 

taaatacaag aacattggag ctaaacttgt 4 80 

agctggggat ggcactacca ctgctactgt 540 

cgagaagatt agcaaaggtg ctaatccagt 600 

tgatgctgta attgctgaac ttaaaaagca 660 

tgcacaggtt gctacgattt ctgcaaacgg 720 

tgcaatgaaa aaagttggaa gaaagggtgt 780 

tgatgaacta gaaattattg aaggcatgaa 840 

tattaataca tcaaaaggtc agaaatgtga 900 

aaagaaaatt tctagtaccc agtccattgt 960 

taagcctttg gtcataatcg ctgaagatgt 1020 

gaataggcta aaggttggtc ttcaggttgt 1080 

tagaaagaac cagcttaaag atatggctat 114 0 

gggattgacc ctgaatcttg aagacgttca 1200 

cattgtgacc aaagacgatg ccatgctctt 1260 

aaaacgtatt caagaaatca ttgagcagtt 1320 

aaaactgaat gaacggcttg caaaactttc 1380 

gacaagtgat gttgaagtga atgaaaagaa 1440 

aagagctgct gttgaagaag gcattgtttt 1500 

tccagccttg gactcattga ctccagctaa 1560 

taaaagaaca ctcaaaattc cagcaatgac 1620 

cttgacagct gagaaaatta tgcaaagttc 1680 

agattttgcg aatatggtgg aaaaaggaat 1740 

actttctaaa aattcttttg ttgttcattt 1800 

gattttgtca ccagaaaaat agtctaatag 1860 

tcaacaattc tcatgtacat ttccagcacc 1920 

aggatccctt gagtagcagt tgagtccatg 1980 

tggaacaccc tgttcctgcc tggcagtaag 2040 

2066 



<210> 90 
<211> 366 
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<212> DNA 

<213> Homo sapien 



<400> 90 

gccagtgtga 


tggattcgcg 


gcgaggtgtc 


tactgttcac 


gaaatcctgt 


gcaagctcag 


60 


cttggagggt 


gatcactcta 


cacccccaag 


tgcatatggg 


tctgtcaaag 


cctatactaa 


120 


ctttgatgct 


gagcgggatg 


ctttgaacat 


tgaaacagcc 


atcaagacca 


aagaggtgtg 


180 


gatgaggtca 


ccattgtcaa 


cattttgacc 


aaccgcagca 


atgcacagag 


acaggatatt 


240 


gccttcgcct 


accagagaag 


gaccaaaaag 


gaacttgcat 


cagcactgaa 


gtcagcctta 


300 


tctggccacc 


tggagacggt 


gact ccgggc 


ctat tgaaga 


cacctgctca 


gcatgacgct 




tctgag 












.3 O Q 


<210> 91 

<211> 1346 

<212> DNA 

<213> Homo sapien 












<400> 91 
catgctcgag 


cggcgcagtg 


tgatggatcg 


tggtcgcggc 


cgagggacgc 


tctcagctct 


60 


cggcgcacgg 


cccagcLr ucc 


ttcaaaatgt 


ctactgttca 


cgaaatcctg 


tgcaagctca 


120 


gcttggaggg 


tgatcactct 


acacccccaa 


gtgcatatgg 


gtctgtcaaa 


gcctatacta 


180 


actttgatgc 


tgagcgggat 


gctttgaaca 


ttgaaacagc 


catcaagacc 


aaaggtgtgg 


240 


atgaggtcac 


cattgtcaac 


attttgacca 


accgcagcaa 


tgcacagaga 


caggatattg 


300 


ccctcgccta 


ccagagaagg 


accaaaaagg 


aacttgcatc 


agcactgaag 


tcagccttat 


360 


ctggccacct 


ggagacggtg 


attttgggcc 


tattgaagac 


acctgctcag 


tatgacgctt 


420 


ctgagctctg 


ctccagaacc 


aaccaggagc 


tgcaggaaat 


taacagagtc 


tacaaggaaa 


480 


tgtacaagac 


tgatctggag 


aaggacatta 


tttcggacac 


atctggtgac 


ttccgcaagc 


540 


tgatggttgc 


cctggcaaag 


ggtagaagag 


cagaggatgg 


ctctgtcatt 


gattatgaac 


600 


tgattgacca 


agatgctcgg 


gatctctatg acgctggagt 


gaagaggaaa 


ggaactgatg 


660 


ttcccaagtg 


gatcagcatc 


atgaccgagc 


ggagcgtgcc 


ccacctccag 


aaagtatttg 


720 


ataggtacaa 


gagttacagc 


ccttatgaca 


tgttggaaag 


catcaggaaa 


gaggttaaag 


780 


gagacctgga 


aaatgctttc 


ctgaacctgg 


ttcagtgcat 


tcagaacaag 


cccctgtatt 


840 


ttgctgatcg 


gctgtatgac 


tccatgaagg gcaaggggac 


gcgagataag gtcctgatca 


900 


gaatcatggt 


ctcccgcagt 


gaagtggaca 


tgttgaaaat 


taggtctgaa 


ttcaagagaa 


960 


agtacggcaa 


gtccctgtac 


tattatatcc 


agcaagacac 


taagggcgac 


taccagaaag 


1020 


cgctgctgta 


cctgtgtggt 


ggagatgact gaagcccgac 


acggcctgag 


cgtccagaaa 


1080 
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tggtgctcac 


catgcttcca 


gctaacaggc 


cgtggccatc 


cctgtgaggg 


tgacgttagc 


gcattgcctg 


gccttcctgt 


ctagtctctc 


agttggaagt 


gaagtctatg 


atgtgaaaca 


gatgaataaa 


ctgaatttgt 


acctta 


<210> 92 

<211> 756 

<212> DNA 

<213> Homo sapien 




<400> 92 
ccgaggcccc 


tccttgctcg 


acgccgatat 


ctctcgcgcg 


cggtgtggtg 


gcagcaggcg 


acgacgccgg 


cgagttcgtg 


gacctgtacg 


tcatcggtgc 


caaggaccac 


gcatccatcc 


caggcaggtt 


taatggccag 


tttaaaactt 


gtgagtcaga 


tgattccatt 


ctccgactgg 


gttgggggct 


cacatttggg 


cagagtgagt 


aacgttgtcc 


ttttcccctg 


gttctaggaa 


atatttgtca 


taaataaata 


atgaaaacct 


cttctctcaa 


ataataacca 


tacacaacac 


atccttagtc 


atttcttttg 


tgcgcacaat 


acaccaacca 


cccaatatct 


ttaaacctag 


taaaggagtc 


ggcctaagac 


aatatggcct 



<210> 93 

<211> 1420 

<212> DNA 

<213> Homo sapien 

<400> 93 

ccgaggcccc tccttgctcg acgccgatat 
ctctcgcgcg cggtgtggtg gcagcaggcg 
acgacgccgg cgagttcgtg gacctgtacg 
tcatcggtgc caaggaccac gcatccatcc 
caggcaggtt taatggccag tttaaaactt 
tgagtgtttc cctgggcttt gctcatcact 



123 








ctagaaaacc 


agcctgcgaa 


taacagtccc 


1140 


attaccccca 


acctcatttt 


agttgcctaa 


1200 


ctgtaagcca 


aagaaatgaa 


cattccaagg 


1260 


ctttgcctcc 


tgtgtactgt 


gtcataaaca 


1320 








1346 


ctctgccggg 


tgactagctg 


cttcctttct 


60 


cagcaggcgc 


acccagcctc 


gaaatgcaga 


120 


tgccgcggaa 


atgctccgct 


agcaatcgca 


180 


agatgaacgt 


ggccgaggtt 


gacaaggtca 


240 


atgctatctg 


cggggccatt 


cgtaggatgg 


300 


ccaaggccga 


tggcatcgtc 


tcaaagtaag 


360 


ggactaggac 


tgctccagag 


gcgtggtctt 


420 


cttttgaccg 


gagagaatca 


cagatgtgga 


480 


aaaaaaaaaa 


aaaaaaaaac 


tcgagactag 


540 


taaggggcga 


acctgatctc 


ttatacaagt 


600 


taacctcctc 


ggactccggc 


tcactcattt 


660 


catgggcatc 


ctcttatgag 


gggggcggga 


720 


agccat 






756 


ctctgccggg 


tgactagctg 


cttcctttct 


60 


cagcaggcgc 


acccagcctc 


gaaatgcaga 


120 


tgccgcggaa 


atgctccgct 


agcaatcgca 


180 


agatgaacgt 


ggccgaggtt 


gacaaggtca 


240 


atgctatctg 


cggggccatt 


cgtaggatgg 


300 


tcgggacatc 


gtggacttta 


ccgtgcgcat 


360 



900. 
960 
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tggagtgtgc gatggtgcct gagtagatct gctggcagag tagtttgagc cagctggact 420 

gggctggccg cctgccgctt cttgagggtg gaagaggggt gctctgagaa gacactcagg 4 80 

cagcagactc tgcctctcac taggaggtgc ccccccgacc ccgctccacc atagtcaggc 540 

tgcaggctgc cccgggagag gtggctcccc ttctgcgcct gtctccattc gctcagcggg 600 

ggagagacgt gggctggtgg cacagctgac cttctgccat ctcaggcagc cggagtggaa 660 

atattcttag tgtgcttttt ttttcttctt aagggtgagt cagatgattc cattctccga 720 

ttggccaagg ccgatggcat cgtctcaaag taaggttggg ggctcacatt tgggcagagt 780 

gagtggacta ggactgctcc agaggcgtgg tcttaacgtt gtccttttcc cctggttcta 84 0 
ggaacttttg actggagaga atcacagatg tggaatattt gtcataaata aataatgaaa 
acctaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 

aaaaaaaaaa aaaagagggg ggggcgcgcc caaaaaatcc ccccgggggg cgcgcctttg 1020 

cgcccccgct tttgtgtgaa aggggggccc ccatgagggg ctttttaaag ggccgcgcag 1080 

cggggcgcgc gttataaaaa gcaccagcag cgagtggggg aacacagccg agccacgcgg 114 0 

gggagatctc ttggggagaa ggagccccca tatcggcggg gggggggaga gcaaaattag 1200 

ggggggaaac accctacacc taaacgaaga tacattaaga aactcttggg aggggaagta 1260 

atatataaac ttttcagaga gggggtatat aggtgggtga aaaaccagag acgcgagatc 1320 

gtatggatgt ggggtggtaa aagaatattg tgggtctagc gagtgtatgt aatttcgacg 1380 

aaactttatt atagcgaggg gcgttttaga tgataaggtg 1420 

<210> 94 
<211> 1536 
<212> DNA 
<213> Homo sapien 

<400> 94 

ggcacgaggc atcgcgcgcg gtgtggtggc agcaggcgca gcaggcgcac ccagcctcga 60 

aatgcagaac gacgccggcg agttcgtgga cctgtacgtg ccgcggaaat gctccgctag 120 

caatcgcatc atcggtgcca aggaccacgc atccatccag atgaacgtgg ccgaggtgag 180 

ctgggagccc gggaggcggg aaggttgtga tatatgtgcg ggaaaggcag gctgtcccat 240 

tgtggaggag cccctggggt gaaggtacag gcagaggctg gctttgagga ttggtgtttc 300 

ccaaacctgg gggagtggtt tgtgaccctt cctctctttc taggttgaca aggtcacagg 360 

caggtttaat ggccagttta aaacttatgc tatctgcggg gccattcgta ggatggtgag 420 

tgtttccctg ggccttgctc atcacttcgg gacatcgtgg actttaccgt gcgcattgga 480 

gtgtgtgatg gtgcctgagt agatctgctg gcagagtagt ttgagccagc tggactgggc 540 
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tggccgcctg 


ccgcttcttg 


agggtggaag 


aggggtgctc 


tgagaagaca 


ctcaggcagc 


600 


agactctgcc 


tctcactagg 


aggtgccccc 


ccgaccccgc 


tccaccatag 


tcaggctgca 


660 


ggctgccccg 


ggagaggtgg 


ctccccttct 


gcgcctgtct 


ccattcgctc 


agcgggggag 


720 


agacgcgggc 


tggtggcaca 


gctgaccttc 


tgccatctca 


ggcagccgga 


gtggaaacat 


780 


tcttagtgtg 


cttttttttt 


tttcttaagg 


gtgagtcaga 


tgattccatt 


ctccgattgg 


840 


ccaaggccga 


tggcatcgtc 


tcaaagtaag 


gttgggggct 


cacatttggg 


cagagtgagt 


900 


ggactaggac 


tgctccagag 


gcgtggtctt 


aacgttgtcc 


ttttcccctg gttctaggaa 


960 


cttttgactg 


gagagaatca 


cagatgtgga 


atattcgtca 


taaataaata atgaaaacct 


1020 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


1080 


aaaaaaaaaa 


gagggggggg 


cgcgcccaaa 


aaatcccccc 


ggggggcgcg 


cctttgcgcc 


1140 


cccgcttttg 


tgtgaaaggg 


gggcccccat 


gaggggcttt 


ttaaagggcc 


gcgcagcggg 


1200 


gcgcgcgtta 


taaaaagcac 


cagcagcgag 


cgggggaaca 


cagccgagcc 


acgcggggga 


1260 


gatctcttgg 


ggagaaggag 


cccccatatc 


ggcggggggg 


gggagagcaa 


aattaggggg 


1320 


ggaaacaccc 


tacacctaaa 


cgaagatata 


ttaagaaact 


cttgggaggg gaagtaatat 


1380 


ataaactttt 


cagagagggg 


gtatataggt 


gggtgaaaaa 


ccagagacgc gagatcgtat 


1440 


ggatgcgggg 


tggtaaaaga 


atactgtggg 


tctagcgagt 


gtatgtaatt 


tcgacgaaac 


1500 


tttattatag 


cgaggggcgt 


tttagatgat 


aaggtg 






X ^ J o 


<210> 95 

<211> 930 

<212> DNA 

<213> Homo sapien 












<400> 95 

agatcatgcc gagcggcgcc 


agtgtgatgg 


atgcgtggtc gcggccgagg 


tacgtgccgc 


60 


ggaaacgctc 


cgctagcaat 


cgcatcatcg 


gtgccaagga 


ccacgcatcc 


atccagatga 


120 


acgtggccga 


ggttgacaag 


gtcacaggca 


ggtttaatgg 


ccagtttaaa 


acttatgcta 


180 


tctgcggggc 


cattcgtagg 


atgggtgagt 


cagatgattc 


cattctccga 


ttggccaagg 


240 


ccgatggcat 


cgtctcaaag 


taaggttggg 


ggctcacatt 


tgggcagagt 


gagtggacta 


300 


ggactgctcc 


agaggcgtgg 


tcttaacgtt 


gcccttttcc 


cctggttcta 


ggaacttttg 


360 


actggagaga 


atcacagatg 


tggaatattt 


gtcataaata 


aataatgaaa 


acctaaaaaa 


420 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


480 


aaaagagggg 


ggggcgcgcc 


caaaaaatcc 


ccccgggggg 


cgcgcctttg 


cgcccccgct 


540 


tttgtgtgaa 


aggggggccc 


ccacgagggg 


ctttttaaag 


ggccgcgcag 


cggggcgcgc 


600 
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gttataaaaa 


gcaccagcag 


cgagtggggg 


aacacagccg 


agccacgcgg 


gggagatctc 


660 


ttggggagaa 


ggagccccca 


tatcggcggg gggggggaga gcaaaattag ggggggaaac 


720 


accccacacc 


caaacgaaga 


tatattaaga 


aactcttggg 


aggggaagta 


atatataaac 


780 


ttttcagaga 


gsgggtatat 


aggtgggcga 


aaaaccagag 


acgcgagatc 


gtatggatgt 


840 


ggggtggtaa 


aagaatattg 


tgggtctagc 


gagtgcatgt 


aatttcgacg 


aaactttatt 


900 


atagcgaggg 


gcgttttaga 


tgataaggtg 








930 



<210> 96 

<211> 185 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC_FEATURE 

<222> (35) . . (35) 

<223> X=any amino acid 



<220> 

< 2 2 1 > M I SC_FE ATURE 

<222> (70).. (70) 

<223> X=any amino acid 



<220> 

< 22 1 > MISC_FEATURE 

<222> (81).. (81) 

<223> X=any amino acid 



<220> 

<221> MISC^FEATURE 

<222> (93).. (93) 

<223> Xsany amino acid 



<400> 96 

Gin Lys Ser lie His Ala Cys Asn Val Gly Gly Arg Leu Leu Cys Gin 
15 10 15 



Asp Arg Pro Pro Thr Leu Gin Lys Ser lie His Ala Cys Ala Ala Arg 
20 25 30 



lie Ala Xaa Ser Ser Gly His Arg Pro Gly Thr Phe Ser Arg Val Thr 
35 40 45 



Ala Leu Asn Asp Val Glu Thr Arg Asp Ser Thr Trp Pro His Ala Arg 
50 55 60 
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Cys Glu Gly Pro Ala Xaa Ser Arg Asp Val Trp Thr Pro Ala Gly Cys 
65 70 75 80 



Xaa Gin Glu Ala Val Glu Leu Val Gin Tyr Ala Tyr Xaa Ser Glu Lys 
85 90 95 



Val Arg Gly Glu Arg Arg Arg Thr Arg Lys Glu Ala Asn Val Lys Asp 
100 105 110 



Glu Val Lys Asp Arg Gin lie Asp Arg Gly Glu Thr Ala Lys Arg Thr 
115 120 125 



Leu Glu Gin Lys Arg Lys Arg Arg Lys Thr Arg Gin Pro Asp Ala Lys 
130 135 140 



Asp Gly Asp Ser Tyr Asp Pro Tyr Asp Phe Ser Asp Thr Glu Glu Glu 
145 150 155 160 



Met Pro Gin Val His Thr Pro Lys Thr Ala Asp Ser Gin Glu Thr Lys 
165 170 175 



Glu Ser Gin Lys Val Glu Leu Ser Glu 
180 185 



<210> 97 

<211> 109 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC_FEATURE 

<222> (11) . . (11) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (39).. (39) 

<223> Xssany amino acid 



<220> 

<221> MISC^FEATURE 

<222> (41).. (42) 

<223> Xsany amino acid 



<220> 

<221> MISC_FEATURE 

<222> (44).. (44) 

<223> Xsany amino acid 
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<:220> 

<221> MISC_FEATURE 

<222> (55).. (57) 

<223> X=any amino acid 



<220> 

<221> MISC^FEATURE 

<222> (77) . . (77) 

<223> X=any amino acid 



<400> 97 

Ala Glu Thr Cys Gly Pro Leu Gin Asp Ala Xaa Arg Lys Leu Trp Ser 
15 10 15 



Trp Ser Ser Met Leu Thr Phe Arg Glu Gly Ser Trp Arg Thr Glu Lys 
20 25 30 



Lys Arg Lys Lys Arg Ser Xaa Gly Xaa Xaa Gin Xaa Gin Lys Met Lys 
35 40 45 



Arg Arg Lys Ala Lys Arg Xaa Xaa Xaa Arg Arg Gly Arg Glu Gly Arg 
50 55 60 



Leu Ala Ser Gin Met Pro Lys Met Gly lie His Thr Xaa Pro Met Thr 
65 70 75 80 



Ser Val Thr Gin Arg Arg Lys Cys Leu Lys Tyr Thr Leu Gin Arg Arg 
85 90 95 



Gin Thr His Arg Arg Pro Arg Asn Pro Arg Lys Trp Ser 
100 105 



<:210> 98 

<211> 106 

<212> PRT 

<213> Homo sapien 

<400> 98 

Pro Gly Leu lie Pro Leu Glu Asp Lys Glu Asp Tyr Gly Pro Asn Lys 
1.5 10 15 



Glu Cys Pro Leu Cys Leu Cys Pro Arg Leu Phe Glu Ser Leu Ser Arg 
20 25 30 



Asp Leu Lys Lys Asp Tyr Gly Val Tyr Leu Glu Asp Ser Gly Thr His 
35 40 45 
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Cys Leu Glu Val Ser Val Gin lie Phe lie Asp Asp Lys Gly He Leu 
50 55 60 



Arg Gin He Thr Leu Asn Asp Leu Pro Val Gly Arg Ser Val Asp Glu 
65 70 75 80 



Thr Leu Arg Leu Val Gin Ala Phe Gin Tyr Thr Asp Lys His Gly Glu 
85 90 95 



Val Cys Pro Ala Gly Trp Lys Pro Gly Lys 
100 105 



<210> 99 

<211> 75 

<212> PRT 

<213> Homo sapien 

<400> 99 

He Pro Lys Ser Arg Ser Gin Lys Asp Tyr Gly Val Tyr Leu Glu Asp 
15 10 15 



Ser Gly His Thr Leu Arg Gly Leu Phe lie He Asp Asp Lys Gly He 
20 25 30 



Leu Arg Gin He Thr Leu Asn Asp Leu Pro Val Gly Arg Ser Val Asp 
35 40 45 



Glu Thr Leu Arg Leu Val Gin Ala Phe Gin Tyr Thr Asp Lys His Gly 
50 55 60 



Glu Val Cys Pro Ala Gly Trp Lys Pro Gly Lys 
70 75 



65 




<210> 


100 


<211> 


224 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


100 



Met Ser Tyr Leu Lys Arg Leu Cys Gly Thr Phe Leu Gly Gly Pro Lys 
1 5 10 15 



Pro Pro Gin Arg Val Met Phe Thr Glu Asp Leu Lys Leu Pro Ala Ser 
20 25 30 



Phe Asp Ala Arg Glu Gin Trp Pro Gin Cys Pro Thr He Lys Glu He 
35 40 45 
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Arg Asp Gin Gly Ser Cys Gly Ser Cys Trp Ala Phe Gly Ala Val Glu 
50 55 60 



Ala lie Ser Asp Arg lie Cys He Gin His Gin Cys Ala Arg Arg Ala 
65 70 75 80 



Trp Arg Cys Arg Arg Arg Thr Cys Ser His Ala Val Ala Ala Cys Val 
85 90 95 



Gly Thr Ala Val Met Val Ala He Leu Leu Lys Leu Gly Thr Ser Gly 
100 105 110 



Gin Glu Lys Ala Trp Phe Leu Val Ala He Tyr Glu Ser His Val Gly 
115 120 125 



Cys Arg Pro Tyr Phe His Thr Leu Pro Val Ser Thr Thr Ser Lys Gly 
130 135 140 



Ser Arg Pro Pro Cys Thr Gly Glu Gly Asp Thr Pro Lys Cys Ser Lys 
145 150 155 160 



Ser Cys Glu Pro Gly Tyr Arg Pro Ser Tyr Lys Gin Asp Lys Arg Tyr 
165 170 175 



Gly Tyr Asn Ser Tyr Ser Val Ser Asn Ser Glu Lys Asp He Met Ala 
180 185 190 



Glu He Ser Lys Asn Gly Pro Trp Arg Glu Leu Ser Leu Cys He Gly 
195 200 205 



Leu Pro Gly Leu Glu Val Arg Glu Cys Thr Asn Thr Ser Pro Glu Arg 
210 215 220 



<210> 101 

<211> 181 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC_FEATURE 

<222> (16) , . (16) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (19) . . (20) 

<223> X=any amino acid 
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<220> 

<221> MISC_FEATURE 

<222> (22).. (29) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (31).. (32) 

<223> X=any amino acid 



<220> 

< 2 2 1 > MI SC_FEATURE 

<222> (34)7.(34) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (37).. (37) 

<223> X=any amino acid 



<220> 

< 2 2 1 > MI S COFEATURE 

<222> (153) . . (153) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (172) . . (172) 

<223> X=any amino acid 



<400> 101 

Pro Leu Arg Gin Arg Gin Pro Leu Arg Cys Ala Gin Ala Gly Leu Xaa 
15 XO 15 



Ala Leu Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa 
20 25 30 



Ser Xaa Leu Ser Xaa Ser Leu Cys Cys Leu Leu Val Leu Ala Asn Ala 
35 40 45 



Arg Ser Arg Pro Ser Phe His Pro Leu Ser Asp Glu Leu Val Asn Tyr 
50 55 60 



Val Asn Lys Arg Asn Thr Thr Trp Gin Ala Gly His Asn Phe Tyr Asn 
65 70 75 80 



Val Asp Met Ser Tyr Leu Lys Arg Leu Cys Gly Thr Phe Leu Gly Gly 
85 90 95 
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Pro Lys Pro Pro Gin Arg Val Met Phe Thr Glu Asp Leu Lys Leu Pro 
100 105 110 

Ala Ser Phe Asp Ala Arg Glu Gin Trp Pro Gin Cys Pro Thr lie Lys 

115 120 125 

Glu lie Arg Asp Gin Gly Ser Cys Gly Ser Cys Trp Ala Phe Gly Ala 

130 135 140 



Val Glu Ala He Ser Asp Arg He Xaa He His Thr Asn Ala His Val 
145 150 155 160 



Glu Arg Gly Gly Val Gly Gly Gly Pro Ala His Xaa Leu Trp Gin His 
165 170 175 



Val Trp Gly Arg Leu 
180 



<210> 102 

<211> 227 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC_FEATURE 

<222> (45).. (45) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (78).. (78) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (85).. (85) 

<223> X=any amino acid 



<400> 102 

Thr Ser Leu His Glu Asp Asp Trp Arg Ser Arg Pro Ser Arg Gly Pro 
15 10 15 



Ala Leu Thr Pro He Arg Asp Glu Glu Trp Gly Gly His Ser Pro Arg 
20 25 30 



Ser Pro Arg Gly Trp Asp Gin Glu Pro Ala Arg Glu Xaa Ala Gly Gly 
35 40 45 
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Gly Trp Arg Ala Arg Arg Pro Arg Ala Arg Ser Asp Arg Arg His Trp 
50 55 60 



Thr Thr Ser Pro Arg Arg Ala Pro His Glu Ser Gly Ser Xaa Ser Pro 
65 70 75 80 



Thr Asn Asn Gly Xaa Arg Ser Arg Ala Tyr Met Pro Thr Val Asp Pro 
85 90 95 



His Val Arg Asp Asp Leu Leu Trp Thr Lys Tyr Asn Ser Arg Asp lie 
100 105 110 



Pro Thr Ala Thr Thr Gly Asp Pro Leu Leu Leu Tyr Asn lie Gin Ala 
115 120 125 



Leu Arg Asp Ala Ala Leu Leu Ser Tyr Pro Met Val Pro Thr His His 
130 135 140 



Ala Tyr Leu Gly Thr Leu Trp Asp Lys Arg Leu Pro Gly Ser Gly Asp 
145 150 155 160 



Leu Pro Tyr Asp Gly Arg Leu Leu Glu Glu Ala Val Arg Lys Lys Gly 
165 170 175 



Gly Arg Arg Arg Arg Arg lie Pro His Lys Glu Glu Glu Glu Glu Ala 
180 185 190 



Tyr Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin Ala 
195 200 205 



Ser Arg Glu Arg Arg Leu Lys Lys Asn Leu Ala Leu Ser Arg Glu Ser 
210 215 220 



Leu Val Val 



225 




<210> 


103 


<211> 


222 


<212> 


PRT 


<213> 


Homo sap i en 


<220> 




<221> 


MISC FEATURE 


<222> 


(10) . . (10) 


<223> 


X=any amino acid 
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<220> 

< 2 2 1 > MI SC_FE ATURE 

<222> (45).. (45) 

<223> Xsany amino acid 



<220> 

< 2 2 1 > MI SC_FEATURE 

<222> (59) . , (59) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (79).. (79) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (86).. (86) 

<223> X=any amino acid 



<400> 103 



Ser Pro Pro Ser Thr Arg Thr lie Gly Xaa Leu Gly Leu Pro Gly Ala 
15 10 15 



Leu Pro Ser Pro Arg Ser Gly Met Arg Ser Gly Val Ala Thr Pro Pro 
20 25 30 



Gly Val Pro Gly Asp Gly Thr Arg Ser Pro Pro Gly Xaa Arg Gin Ala 
35 40 45 



Gly Ala Gly Gly Pro Gly Gly Pro Gly Pro Xaa Pro lie Asp Ala Thr 
50 55 60 



Gly Arg Pro His Pro Ala.Glu His Arg Thr Ser Gin Gly Ala Xaa Leu 
65 70 75 80 



Pro Arg lie Met Val Xaa Glu Ala Gly His Tyr Met Pro Pro Gin Ser 
85 90 95 



Pro Ser Arg Asp Asp Leu Tyr Asp Gin Asp Asn Ser Arg Asp He Pro 
100 105 110 



Thr Leu Pro Gin Ala Thr Pro He Tyr Asp Asn He Gin Ala Pro Arg 
115 120 125 



Glu Arg Pro Pro Ala Tyr Pro Arg Ser His His His Arg Thr Arg Asp 
130 135 140 
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Pro Arg Asp Asn Gly Ser Arg Ser Gly Asp Leu Pro Tyr Asp Gly Arg 
145 150 155 160 



Leu Leu Glu Glu Ala Val Arg Lys Lys Gly Val Gly Gly Glu Glu Asp 
165 170 175 



Thr Pro Gin Gly Gly Gly Gly Arg Gly Leu Leu Pro Ala Arg Ala Ala 
180 185 190 



Pro Val Leu Gly Asp Arg Leu Ala Gly Val Pro Arg Ala Gin Ala Gin 
195 200 205 



Glu Glu Leu Gly Pro Glu Ser Gly Lys Phe Ser Arg Leu He 
210 215 220 



<210> 104 

<211> 74 

<212> PRT 

<213> Homo sapien 

<400> 104 

Met Arg Leu Gly Val Phe Val Arg Arg Leu Leu Cys Val Pro Gly Arg 
15 10 15 



Gly Asp Asp Val Val Leu Val Val Val Cys Leu Trp Glu Pro His Val 
20 25 30 



Gly Thr Ala Val Gly Lys Tyr Tyr Arg Arg Ala Lys Cys Gly Gly Pro 
35 40 45 



Ser Ser Leu Asp Gly He Cys Met Met Ser Ser Glu Gly Arg Asp Val 
50 55 60 



Cys Gly Gly Leu Arg Phe Leu Ser Cys He 
65 70 



<210> 105 

<211> 85 

<212> PRT 

<213> Homo sapien 

<400> 105 

Gly Val Cys Ser Gly Val Leu Leu Ala Trp Ser Asp Ala Ser Trp Ser 
15 10 15 



Phe Arg Glu Ala Pro Leu Cys Val Pro Gly Arg Gly Asp Asp Val Val 
20 25 30 
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Leu Val Val Val Cys Leu Trp Glu Pro His Val Gly Thr Ala Val Gly 
35 40 45 



Lys Tyr Tyr Arg Arg Ala Lys Cys Gly Gly Pro Ser Ser Leu Asp Gly 
50 55 60 



lie Cys Met Met Ser Ser Glu Gly Arg Asp Val Cys Gly Gly Leu Arg 
65 70 75 80 



Phe Leu Ser Cys lie 
85 



<210> 106 

<211> 85 

<212> PRT 

<213> Homo sapien 

<400> 106 

Gly Val Cys Ser Gly Val Leu Leu Ala Trp Ser Asp Ala Ser Trp Ser 
15 10 15 



Phe Arg Glu Ala Pro Leu Cys Val Pro Gly Arg Gly Asp Asp Val Val 
20 25 30 



Leu Val Val Val Cys Leu Trp Glu Pro His Val Gly Thr Ala Val Gly 
35 40 45 



Lys Tyr Tyr Arg Arg Ala Lys Cys Gly Gly Pro Ser Ser Leu Asp Gly 
50 55 60 



He Cys Met Met Ser Ser Glu Gly Arg Asp Val Cys Gly Gly Leu Arg 
65 70 75 80 



Phe Leu Ser Cys He 
85 



<210> 107 

<211> 66 

<212> PRT 

<213> Homo sapien 

<400> 107 

Thr Gly Arg Leu Tyr Ser Pro Pro Glu Cys Arg Gly Lys Ser Leu Thr 
15 10 15 



Ser Lys Gly Pro Thr Lys Gin Phe Arg Asn Leu Pro Pro Val Asn Val 
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20 25 30 



Pro Thr Thr Glu Val Ser Pro Thr Phe Ser Glu Asn His His Lys Asn 
35 40 45 



His His Thr Lys Cys Ser Ser Tyr Thr Glu Tyr Thr Cys Gin Gly Ser 
50 55 60 



Ser Arg 
65 



<210> 


108 


<211> 


66 


<212> 


PRT 


<213> 


Homo sapien 


<220> 




<221> 


MISC FEATURE 


<222> 


(54) . . (54) 


<223> 


X=any amino acid 


<400> 


108 



Thr Gly Arg Leu Tyr Ser Pro Pro Glu Cys Arg Gly Lys Ser Leu Thr 
15 10 15 



Ser Lys Gly Pro Thr Lys Gin Phe Arg Asn Leu Pro Pro Val Asn Val 
20 25 30 



Pro Thr Thr Glu Val Ser Pro Thr Ser Gin Lys Thr Thr Thr Lys Thr 
35 40 45 



Thr Thr Pro Asn Ala Xaa Ala Thr Arg Ser Thr Pro Ala Arg Asp Pro 
50 55 60 



Leu Glu 



65 




<210> 


109 


<211> 


126 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


109 



Met Trp His Leu Ser Pro Phe Ala Leu Gly He Cys Asp Pro Ser He 
1 5 10 15 



Val Leu Arg Pro Leu Cys Pro His Phe Pro Val His Val Gly Asp Asp 
20 25 30 
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Gly Ser Pro Phe Pro Phe Ala Gin Leu Pro Pro Gly Ala Arg Gly Pro 
35 40 45 



Ser Pro Gin Gly Val Trp lie Tyr Ser Phe He Arg Pro Gly Pro Pro 
50 55 60 



Met Phe Ala Cys Leu Cys Thr Ser Thr Pro Asn Val Ser Ala Leu Pro 
65 70 75 80 



Pro Glu Ala Leu Cys Arg Ala Ser Leu Phe .Trp Arg Gly Arg Gly Cys 
85 90 95 



Gly Val Thr Cys Thr Leu Gly Leu Val Asp Thr Val Asn Ser Ser Gin 
100 105 110 



Val Asp Phe Ser Gly Gly Glu Lys Lys Gly His Leu Arg Leu 

120 125 





115 


<210> 


110 


<211> 


117 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


110 


Leu Gly Pro Val Phe 


1 


5 



10 15 



Thr Cys Val Gly Met Trp His Leu Ser Pro Phe Ala Leu Gly He Cys 
20 25 30 



Asp Pro Ser He Val Leu Arg Pro Leu Cys Pro His Phe Pro Val His 
35 40 45 



Val Gly Asp Asp Gly Ser Pro Phe Pro Phe Ala Gin Leu Pro Pro Gly 
50 55 60 



Ala Arg Gly Pro Ser Pro Gin Gly Val Trp He Tyr Ser Phe He Arg 
65 70 75 80 



Pro Gly Pro Pro Met Phe Ala Cys Leu Cys Thr Ser Thr Pro Asn Val 
85 90 95 



Ser Ala Leu Pro Pro Glu Ala Leu Cys Arg Ala Ser Leu Phe Trp Glu 
100 105 110 
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Asp Gly Gly Ala Val 





115 


<210> 


111 


<211> 


170 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


111 



Met Tyr Phe Lys Asp Tyr lie Gin Glu Arg Ser Asp Pro Val Glu Gin 
15 10 15 



Gly Lys Pro Val lie Pro Ala Ala Val Leu Gly Arg Leu His Arg Lys 
20 25 30 



Trp Thr Tyr Ser Ala Val Ala Val Ser Pro Gly Ala Ala He Thr Gin 
35 40 45 



He Leu Pro Val He His Gin Leu Asp Trp Arg Leu Mec Glu Phe Lys 
50 55 60 



Leu Ala Asp Pro Asp Glu Val Ala Ala Ser Gly Glu Arg Gly Leu Ala 
65 70 75 80 



His Asp Glu Leu Arg Glu Ala Glu Pro Gly Leu Thr Leu Leu Leu Arg 
65 90 95 



Leu Glu His His Ala Gin Asp Val Gly Glu Ser Ala Thr Cys Ser Arg 
100 105 110 



Leu Asn Val Arg Thr Ser Glu Thr Cys Leu Gly Phe Gin Arg Pro Glu 
115 120 125 



Gly Thr Val Thr Arg He Thr Trp Ala Val Thr Thr Pro Tyr Thr Gly 
130 135 140 



Arg Tyr Leu Thr Phe Arg Pro Gly Thr His Pro Leu Asn Pro Ala Pro 
145 150 155 160 



Gin Gly Phe Val Val Pro Val Gly Cys Pro 
165 170 



<210> 112 

<211> 225 

<212> PRT 

<213> Homo sapien 



<220> 
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<221> MISC^FEATURE 

<222> (41). .(43) 

<223> X=any amino acid 



<220> 

< 2 2 1 > MI SC_FEATURE 

<222> (46). .(48) 

<223> Xsany amino acid 



<220> 

<221> MISC^FEATURB 

<222> (50).. (50) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (57).. (57) 

<223> X=any amino acid 



<220> 

<221> MISC^FEATURE 

<222> (68).. (68) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (95).. (95) 

<223> X*any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (101) . . (101) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (110) . . (110) 

<223> X=any amino acid 



<220> 

<221> MISC^FEATURE 

<222> (II3T. . (113) 

<223> X=any amino acid 



<400> 112 

Lys Asp Phe Asp Ser Pro Glu Asn Gly Ala Asp Ser Phe Gin Ser Ser 
15 10 15 



Asp Ser Leu Leu Gin Ser Trp Asn Ser Gin Ser Ser Leu Leu Asp Val 
20 25 30 
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Gin Arg Val Pro Ser Phe Glu Ser Xaa Xaa Xaa Asp Cys Xaa Xaa Xaa 
35 40 45 



Leu Xaa Leu Aan Lys Pro Thr Cys Xaa Ser Arg lie Thr Ser Lys Arg 
50 55 60 



Gly Val Thr Xaa Trp Ser Lys Ala Asn Gin Leu Tyr Leu Gin Leu Cys 
65 70 75 80 



Trp Pro Ala Ser Gin Glu Val Asp Leu Phe Ser Cys Gly Ser Xaa Ser 
85 90 95 



Trp Ser Cys Tyr Xaa Thr Asn Pro Ala Ser His Ser Ser Xaa Gly Leu 
100 105 110 



Xaa Thr Asp Gly Ser Leu Ser Ser Pro Thr Pro Met Arg Trp Pro Ala 
115 120 ' 125 



Ser Gly Glu Arg Gly Leu Ala His Asp Glu Leu Arg Glu Ala Glu Pro 
130 135 140 



Gly Leu Thr Leu Leu Leu Arg Leu Glu His His Ala Gin Asp Val Gly 
145 150 155 160 



Glu Ser Ala Thr Cys Ser Arg Leu Asn Val Arg Thr Ser Glu Thr Cys 
165 170 175 



Leu Gly Phe Gin Arg Pro Glu Gly Thr Val Thr Arg lie Thr Trp Ala 
180 185 190 



Val Thr Thr Pro Tyr Thr Gly Arg Tyr Leu Thr Phe Arg Pro Gly Thr 
195 200 205 



His Pro Leu Asn Pro Ala Pro Gin Gly Phe Val Val Pro Val Gly Cys 
210 215 220 



Pro 
225 



<210> 113 

<211> 175 

<212> PRT 

<213> Homo sapien 



<400> 113 
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Gly Gly Glu Glu Gly Arg Ala Ser Trp Gly Gin Cys Arg Leu Phe Gly 
15 10 15 



Pro Gly Lys Leu Arg Trp Ala Gly Leu Pro Pro Val Trp Leu Cys Gin 
20 25 30 



Gly His Pro Gly Val Leu His Leu Gly Pro Gly Gly Trp Glu Gly Arg 
35 40 45 



Glu Ala Phe Gly Leu Leu Asn His Leu Glu Val Ser Leu Leu Gin Thr 
50 55 60 



Ser Ala Gly Ser Gly Ser Pro Gly Val Met Gly Ser Gly Trp Leu Asn 
65 70 75 80 



Leu Glu lie Val Trp Ser Leu Phe Glu Gly Pro Ala Trp Leu Leu Leu 
85 90 95 



Gin Arg Asn Cys Arg His Leu Ser Phe Pro Ser Leu Pro His Pro Thr 
100 105 110 



Ala Glu Lys Gly Trp Arg Gly Glu Ser Ser Ser Ala Phe His Ser Val 
115 120 125 



Tyr Val Ser Gly Asp Ser Arg Gly Ala Gly Leu Lys lie Ala Gly Gly 
130 135 140 



Arg Pro Ser Pro Gly Cys Cys Ser Val Gly Ala Trp Pro Ser Ser Ser 
145 150 155 160 



Arg Pro Thr Cys Phe Leu Trp Cys Gly Gin Ser Gin Leu Pro Ser 
165 170 175 



<210> 114 

<211> 270 

<212> PRT 

<213> Homo sapien 

<400> 114 

Met Asp Asp Gin Arg Asp Leu lie Ser Asn Asn Glu Gin Leu Pro Met 
15 10 15 



Leu Gly Arg Arg Pro Gly Ala Pro Glu Ser Lys Cys Ser Arg Gly Ala 
20 25 30 



Leu Tyr Thr Gly Phe Ser He Leu val Thr Leu Leu Leu Ala Gly Gin 
35 40 45 
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Ala Thr Thr Ala Tyr Phe Leu Tyr Gin Gin Gin Gly Arg Leu Asp Lys 
50 55 60 



Leu Thr Val Thr Ser Gin Asn Leu Gin Leu Glu Asn Leu Arg Met Lys 
65 70 75 80 



Leu Pro Lys Pro Pro Lys Pro Val Ser Lys Met Arg Met Ala Thr Pro 
85 90 95 



Leu Leu Met Gin Ala Leu Pro Met Gly Ala Leu Pro Gin Gly Pro Met 
100 105 110 



Gin Asn Ala Thr Lys Tyr Gly Asn Met Thr Glu Asp His Val Met His 
115 120 125 



Leu Leu Gin Asn Ala Asp Pro Leu Lys Val Tyr Pro Pro Leu Lys Gly 
130 135 140 



Ser Phe Pro Glu Asn Leu Arg His Leu Lys Asn Thr Met Glu Thr lie 
145 150 155 160 



Asp Trp Lys Val Phe Glu Ser Trp Met His His Trp Leu Leu Phe Glu 
165 170 175 



Met Ser Arg His Ser Leu Glu Gin Lys Pro Thr Asp Ala Pro Pro Lys 
180 185 190 



Val Leu Thr Lys Cys Gin Glu Glu Val Ser His He Pro Gly Cys Pro 
195 200 205 



Pro Gly Phe He Gin Ala Gin Val Arg Arg Glu Arg Gin Leu Ser Ala 
210 215 220 



Thr Pro Val Leu Trp Gly Ala Ser Ala Thr Ala Gly Val Ser Ser Pro 
225 230 235 240 



Thr Ala Arg Arg Ser Pro Thr Pro Glu Ala Ala Gly Thr He Thr Ala 
245 250 255 



Val Ser His Trp Asn Trp Arg Thr Arg Leu Leu Gly Trp Val 
260 265 270 



<210> 115 
<211> 225 
<212> PRT 
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<213> Homo sapien 
<400> 115 

Gly Arg Thr Gly Asp Ala Val Cys Cys Pro Pro Ala Leu Leu Asp Leu 
15 10 15 



Arg Gly Pro Pro Gly Pro Pro Ser Ala Gly Phe Asp Phe Ser Phe Leu 
20 25 30 



Pro Gin Pro Pro Gin Glu Lys Ala His Asp Gly Gly Arg Tyr Tyr Arg 
35 40 45 



Ala Asp Asp Ala Asn Val Val Arg Asp Arg Asp Leu Glu Val Asp Thr 
50 55 60 



Thr Leu Lys Ser Leu Ser Gin Gin lie Glu Asn lie Arg Ser Pro Glu 
65 70 75 80 



Gly Ser Arg Lys Asn Pro Ala Arg Thr Cys Cys Asp Leu Lys Met Cys 
85 90 95 



Gin Ser Asp Trp Lys Ser Gly Glu Tyr Trp lie Asp Pro Asn Gin Gly 
100 105 110 



Cys Ser Leu Asp Ala lie Lys Val Phe Met Arg Thr Met Glu Thr Gly 
115 120 125 



Glu Thr Leu Arg Val Pro His Ser Ala Ser Val Trp Arg Gin Lys Asn 
130 135 140 



Trp Tyr lie Ser Lys Asn Pro Lys Asp Lys Arg His Val Trp Phe Gly 
145 150 155 160 



Glu Ser Met Thr Asp Gly Phe Gin Phe Glu Tyr Gly Gly Gin Gly Ser 
165 170 175 



Asp Pro Ala Asp Val Ala lie Gin Leu Thr Phe Leu Arg Leu Met Ser 
180 185 190 



Ser Glu Ala Phe Gin Asn lie Thr Tyr His Cys Lys Asn Ser Val Ala 
195 200 205 



Tyr Met Asp His Gin Thr Gly Asn Leu Lys Lys Ala Leu Leu Leu Gin 
210 215 220 



Gly 
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225 



<210> 


116 


<211> 


121 


<212> 


PRT 


<213> 


Homo sapien 


<220> 




<221> 


MISC^FEATURE 


<222> 


(89) . . (89) 


<223> 


X=any amino acid 


<400> 


116 



Trp lie Asp Pro Lys Leu Arg Leu Gin Ala Gly Cys His Pro lie Arg 
15 10 15 



Cys Met Arg Ser Met Glu Thr Gly Glu Thr Leu Arg Val Pro His Ser 
20 25 30 



Ala Ser Val Trp Arg Gin Lys Asn Trp Tyr He Ser Lys Asn Pro Lys 
35 40 45 



Asp Lys Arg His Val Trp Phe Gly Glu Ser Met Thr Asp Gly Phe Gin 
50 55 60 



Phe Glu Tyr Gly Gly Gin Gly Ser Asp Pro Ala Asp Val Ala He Gin 
65 70 75 80 



Leu Thr Phe Leu Arg Leu Met Ser Xaa Glu Ala Phe Gin Asn He Thr 
85 90 95 



Tyr His Cys Lys Asn Ser Val Ala Tyr Met Asp His Gin Thr Gly Asn 
100 105 110 



Leu Lys Lys Ala Leu Leu Leu Gin Gly 
115 120 



<210> 117 

<211> 66 

<212> PRT 

<213> Homo sapien 

<400> 117 

Met Ala Leu Asn Arg Glu Phe Ser Phe He Val He Thr Thr Val Thr 
15 10 15 



Leu Thr Leu Cys Leu Ser Gly Leu Phe Ala He Leu Arg Ser Val Gly 
20 25 30 
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Phe Pro lie Phe Arg Asp lie He Glu Leu Thr He Pro Tyr Tyr Gly 
35 40 45 



Phe He Phe Phe Pro Phe Lys His Ser Lys Glu Thr lie Tyr Tyr Phe 
50 55 60 



Phe Pro 



65 




<210> 


118 


<211> 


81 


<212> 


PRT 


<213> 


Homo sapien 


<220> 




<221> 


MISC FEATURE 


<222> 


(75) . . (75) 


<223> 


Xsany amino acid 


<400> 


118 



Gin Arg Val Leu Phe His Cys Asp His Tyr Arg Asp Thr Tyr Phe Val 
15 10 15 



Pro He Arg Thr Phe Cys Asn He Ala Leu Cys Arg Leu Ser Asn Leu 
20 25 30 



Gin Gly Tyr His Arg Ala Arg Pro Phe Pro Thr Met Asp Leu Phe Phe 
35 40 45 



Phe Leu Ser Asn Thr Val Arg Lys Gin Ser He Thr Phe Phe Leu Lys 
50 55 60 



Arg Arg He Tyr Ser Thr Val He Gin Leu Xaa Asn He Phe Arg Met 
65 70 75 80 



Met 



<210> 119 

<211> 253 

<212> PRT 

<213> Homo sapien 

<400> 119 

Met Val Asp Tyr Tyr Glu Val Leu Gly Val Gin Arg His Ala Leu Tyr 
IS 10 15 
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Pro Arg Asp Ser Tyr Lys Arg His He Gly Lys Leu Ala Leu Lys Trp 
20 25 30 



His Pro Asp Lys Asn Pro Glu Asn Lys Glu Glu Ala Ser Ser Arg Lys 
35 40 45 



Phe Lys Gin Val Ala Glu Ala Tyr Glu Val Leu Ser Asp Ala Lys Lys 
SO 55 60 



Arg Asp He Tyr Asp Lys Tyr Gly Asn Arg Arg He Lys Val Val Glu 
65 70 75 80 



Asp Gly Gly Gly Ser His Phe Asp Ser Pro Phe Glu Phe Gly Phe Thr 
85 90 95 



Phe Arg Asn Pro Asp Asp Val Phe Arg Glu Phe Phe Arg Trp Lys Gly 
100 105 110 



Pro He Leu His Leu Thr Ser Leu Lys Thr Leu Leu Arg Thr Ser Leu 
115 120 125 



Gly He Glu Gly Val Pro Glu Glu Ala Glu Ala Glu Gly Arg Gly Arg 
130 135 140 



Phe Asn Leu Arg Ser Val Asp Phe Arg Leu Leu Glu Ala Gly Trp Ser 
145 150 155 160 



Ser Met Asp Ala Gly Phe Thr Ser Leu Gly Ser Leu Gly His Gly Val 
165 170 175 



Leu Thr Leu Phe Ser Ser Thr Ser Phe Gly Gly Ser Gly Met Gly Asn 
180 185 190 



Tyr Lys Ser He Ser Thr Ser Thr Lys Leu Val Asn Gly Arg Pro He 
195 200 205 



Thr Thr Lys Arg He Val Asp Asn Ser Gin Asp Arg Val Gin Val Glu 
210 215 220 



Asp Asp Gly Gin Leu Lys Phe Leu Thr He Gly Tyr Glu Gin Leu Leu 
225 230 235 240 



Cys Leu Asp Asn Lys Met He Gin Arg Thr Arg Leu Ala 
245 250 
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<210> 120 

<211> 203 

<212> PRT 

<213> Homo sapien 



<220> 

<221> MISC^FEATURE 

<222> (32)., (32) 

<223> Xsany amino acid 



<220> 

<221> MISC_FEATURE 

<222> (1047. . (104) 

<223> X=any amino acid 



<400> 120 

Arg Arg Ser Ser Ser Arg Lys Phe Lys Gin Val Ala Glu Ala Tyr Glu 
15 10 15 



Val Leu Ser Asp Ala Lys Lys Arg Asp lie Tyr Asp Lys Tyr Gly Xaa 
20 25 30 



Arg Arg lie Lys Trp Trp Arg Thr Glu Val Glu Val lie Leu Thr Val 
35 40 45 



His Leu Asn Leu Ala Ser His Ser Val Thr Gin Met Met Ser Ser Gly 
50 55 60 



Asn Phe Leu Gly Gly Arg Asp Pro Phe Ser Phe Asp Phe Phe Glu Asp 
65 70 75 80 



Pro Phe Glu Asp Phe Phe Gly Asn Arg Arg Gly Pro Arg Gly Ser Arg 
85 90 95 



Ser Arg Gly Thr Gly Ser Phe Xaa Ser Ala Phe Ser Gly Phe Pro Ser 
100 105 110 



Phe Val Ser Gly Trp Ser Ser Met Asp Ala Gly Phe Thr Ser Leu Gly 
lis 120 125 



Ser Leu Gly His Gly Val Leu Thr Leu Phe Ser Ser Thr Ser Phe Gly 
130 135 140 



Gly Ser Gly Met Gly Asn Tyr Lys Ser lie Ser Thr Ser Thr Lys Leu 
145 150 155 160 



Val Asn Gly Arg Pro lie Thr Thr Lys Arg lie Val Asp Asn Ser Gin 
165 170 175 
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Asp Arg Val Gin Val Glu Asp Asp Gly Gin Leu Lys Phe Leu Thr lie 
180 185 190 



Gly Tyr Glu Gin Leu Leu Cys Leu Asp Asn Lys 
195 200 



<210> 121 

<211> 128 

<212> PRT 

<213> Homo sapien 

<400> 121 

Met Ala Val Gin lie Ser Lys Lys Arg Lys Phe Val Ala Asp Gly lie 
15 10 15 



Phe Lys Ala Glu Leu Asn Glu Phe Leu Thr Arg Glu Leu Ala Glu Asp 
20 25 30 



Gly Tyr Ser Gly Val Glu Val Arg Val Thr Pro Thr Arg Thr Glu lie 
35 40 45 



lie lie Leu Ala Thr Arg Thr Gin Asn Val Leu Gly Glu Lys Gly Arg 
50 55 60 



Arg lie Arg Glu Leu Thr Ala Val Val Gin Lys Arg Phe Gly Phe Pro 
65 70 75 80 



Glu Gly Ser Val Glu Leu Tyr Ala Glu Lys Val Ala Thr Arg Gly Leu 
85 90 95 



Cys Ala He Ser Pro Gly Arg Val Ser Ala Val Pro Thr Pro Arg Arg 
100 105 110 



Ala Arg Cys Ala Ala Ser Phe Leu Ser Leu Ser Arg Thr Pro Met Gly 

120 125 





115 


<210> 


122 


<211> 


143 


<212> 


PRT 


<213> 


Homo i 


<220> 




<221> 


MISC j 


<222> 


(2) .7 


<223> 


X=any 



<220> 
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<221> MISC_FEATURE 
<222> (115) . . (115) 
<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (119) . . (119) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (121) . . (121) 

<223> X=any amino acid 



<220> 

<22 1 > MISC_FEATURE 

<222> (123) . . (123) 

<223> Xsany amino acid 



<220> 

< 2 2 1 > MI SC_FEATURE 

<222> (125) . . (126) 

<223> Xsany amino acid 



<400> 122 

Lys Xaa Ala Thr Gly Ala Phe Leu Ser Ala Glu Arg Gly Gly Lys Met 
15 10 15 



Ala Val Gin lie Ser Lys Lys Arg Lys Phe Val Ala Asp Gly lie Phe 
20 25 30 



Lys Ala Glu Leu Asn Glu Phe Leu Thr Arg Glu Leu Ala Glu Asp Gly 
35 40 45 



Tyr Ser Gly Val Glu Val Arg Val Thr Pro Thr Arg Thr Glu He He 
50 55 60 



He Leu Ala Thr Arg Thr Gin Asn Val Leu Gly Glu Lys Gly Arg Arg 
65 70 75 80 



He Arg Glu Leu Thr Ala Val Val Gin Lys Arg Phe Gly Phe Pro Glu 
85 90 95 



Gly Ser Val Glu Leu Tyr Ala Glu Lys Val Ala Thr Arg Gly Leu Cys 
100 105 110 



Ala He Xaa Pro Ala Glu Xaa Leu Xaa Tyr Xaa Leu Xaa Xaa Gly Ser 
115 120 125 
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Leu Arg Arg Val Phe Pro He Ala Val Pro His Ala His Gly Ala 
130 135 140 



<210> 123 

<211> 75 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC__FEATURE 

<222> (2).. (2) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (21).. (24) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (26).. (26) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (31).. (31) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (36).. (36) 

<223> X=any amino acid 



<220> 

<221> MISC_FBATURE 

<222> (45)7. (45) 

<223> X=any amino acid 



<400> 123 

His Xaa Leu Gin Lys His Leu Ala Gly Leu Gly Leu Thr Glu Ala He 
1 5 10 15 



Asp Lys Asn Lys Xaa Xaa Xaa Xaa Arg Xaa Ser Gly Lys Lys Xaa Phe 
20 25 30 



Tyr Leu Ala Xaa Phe His Ala Thr Ala Phe Glu Leu Xaa Thr Asp Gly 
35 40 45 
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Asn Pro Phe Asp Gin Asp He Tyr Gly Arg Glu Gly Gly Ala Ala Pro 
50 55 60 



Ser Cys Ser Thr Pro Thr Thr Pro Ser Ser Ser 
70 75 



65 




<210> 


124 


<211> 


110 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


124 



Cys Gly Thr Pro Lys Ala Ala Pro Cys Tyr Ser Leu Gly Ala Trp Ser 
15 10 15 



Gly Leu Arg Val Thr Arg Cys Glu Thr Ser Tyr Arg Ala Ser Gly Cys 
20 25 30 



Thr Gin Asp Gly Arg Arg His Pro Lys Ala Pro Glu Thr His Gly Cys 
35 . 40 45 



Tyr Trp Gly Trp Gly Gly Gly Glu Val Pro Ala Leu Asp Thr Pro Trp 
50 55 60 



Gly Gly Gly Gly Lys Thr Asp Arg Gly Ser Arg Val Pro Glu Arg Thr 
65 70 75 80 



Phe Pro Ala Arg He His Ser Thr Trp Thr Trp Ala Pro Asp Thr Met 
85 90 95 



Met Leu Ser Pro Glu Thr Pro His Pro Val Gly Pro Gly Pro 
100 105 110 



<210> 125 

<211> 196 

<212> PRT 

<213> Homo sapien 

<400> 125 

Met Ser Pro Arg Phe Pro Ala Arg Pro Trp Val Val Lys Leu Val Ala 
1 5 10 15 



Ser Leu His Glu Asp Leu His Glu Val Ser Val Arg Ser Arg Pro Ser 
20 25 30 



Pro Val Pro Thr Pro Gly Trp Leu Gly Glu Gly Val Ala Leu Val Asp 
35 40 45 
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Gly Pro Pro Val Gly Asp Pro Leu Ser Arg Val Pro Glu Pro Cys Arg 
50 55 60 



Val Arg Thr Lys Thr Val Lys Lys Ala Ala Arg Val lie He Glu Lys 
65 70 75 80 



Tyr Tyr Thr Arg Leu Gly Asn Asp Phe His Thr Asn Lys Arg Val Cys 
85 90 95 



Glu Glu He Ala He He Pro Ser Lys Lys Leu Arg Asn Lys He Ala 
100 105 110 



Gly Tyr Val Thr His Leu Met Lys Arg He Gin Arg Gly Pro Val Arg 
115 120 125 



Gly He Ser He Lys Leu Gin Glu Glu Glu Arg Glu Arg Arg Asp Asn 
130 135 140 



Tyr Val Pro Glu Val Ser Ala Leu Asp Gin Glu He He Glu Val Asp 
145 150 155 160 



Pro Asp Thr Lys Glu Met Leu Lys Leu Leu Asp Phe Gly Ser Leu Ser 
165 170 175 



Asn Leu Gin Val Thr Gin Pro Thr Val Gly Met Asn Phe Lys Thr Pro 
180 185 190 



Arg Gly Pro Val 





195 


<210> 


126 


<211> 


207 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


126 



Met Pro Glu His Cys Gly Leu Gly His Arg Arg Cys Ala Cys Gin Gin 
15 10 15 



His Gly Ala Ser Pro Gly Arg Met Thr Phe Glu Gly Asp Thr Asp Val 
20 25 30 



Trp Ala Met Pro Gly Ser Trp Glu Gin Arg Pro Arg Ala Gly Pro Gly 
35 40 45 



Val Arg Ala Ala Arg Ala Gly Gly Phe Trp Glu Pro Lys Ala Arg Leu 
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50 55 60 



Arg Leu Gin Thr Leu Gly Pro Asn Met Gly Arg Val Arg Thr Lys Thr 
65 70 75 80 



Val Lys Lys Ala Ala Arg Val lie lie Glu Lys Tyr Tyr Thr Arg Leu 
85 90 95 



Gly Asn Asp Phe His Thr Asn Lys Arg Val Cys Glu Glu lie Ala lie 
100 105 110 



lie Pro Ser Lys Lys Leu Arg Asn Lys lie Ala Gly Tyr Val Thr His 
115 120 125 



Leu Met Lys Arg lie Gin Arg Gly Pro Val Arg Gly lie Ser He Lys 
130 135 140 



Leu Gin Glu Glu Glu Arg Glu Arg Arg Asp Asn Tyr Val Pro Glu Val 
145 150 155 160 



Ser Ala Leu Asp Gin Glu He lie Glu Val Asp Pro Asp Thr Lys Glu 
165 170 175 



Met Leu Lys Leu Leu Asp Phe Gly Ser Leu Ser Asn Leu Gin Val Thr 
180 185 190 



Gin Pro Thr Val Gly Met Asn Phe Lys Thr Pro Arg Gly Pro Val 
195 200 205 



<210> 


127 




<211> 


180 




<212> 


PRT 




<213> 


Homo 


sapien 


<400> 


127 




Gly Gly His 


Arg Cys 


1 




5 



10 15 



Pro Ser Arg Thr Arg Ser Ala Gly Ser Ala Gly Arg Gly Leu Leu Gly 
20 25 30 



Ala Lys Gly Glu Ala Glu Val Ala Asn Ser Gly Ala Asn Met Gly Arg 
35 40 45 



Val Arg Thr Lys Thr Val Lys Lys Ala Ala Arg Val He He Glu Lys 
50 55 60 
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Tyr Tyr Thr Arg Leu Gly Asn Asp Phe His Thr Asn Lys Arg Val Cys 
65 70 75 80 



Glu Glu lie Ala lie He Pro Ser Lys Lys Leu Arg Asn Lys He Ala 
85 90 95 



Gly Tyr Val Thr His Leu Met Lys Arg He Gin Arg Gly Pro Val Arg 
100 105 110 



Gly He Ser He Lys Leu Gin Glu Glu Glu Arg Glu Arg Arg Asp Asn 
115 120 125 



Tyr Val Pro Glu Val Ser Ala Leu Asp Gin Glu He He Glu Val Asp 
130 135 140 



Pro Asp Thr Lys Glu Met Leu Lys Leu Leu Asp Phe Gly Ser Leu Ser 
145 150 155 160 



Asn Leu Gin Val Thr Gin Pro Thr Val Gly Met Asn Phe Lys Thr Pro 
165 170 175 



Arg Gly Pro Val 





180 


<210> 


128 


<211> 


150 


<212> 


PRT 


<213> 


Homo sap i en 


<400> 


128 



Met Gly Arg Val Arg Thr Lys Thr Val Lys Lys Ala Ala Arg Val He 
15 10 15 



He Glu Lys Tyr Tyr Thr Arg Leu Gly Asn Asp Phe His Thr Asn Lys 
20 25 30 



Arg Val Cys Glu Glu He Ala He He Pro Ser Lys Lys Leu Arg Asn 
35 40 45 



Lys He Ala Gly Tyr Val Thr His Leu Met Lys Arg He Gin Arg Gly 
50 55 60 



Pro Val Arg Gly He Ser He Lys Leu Gin Glu Glu Glu Arg Glu Arg 
65 70 75 80 



Arg Asp Asn Tyr Val Pro Glu Val Ser Ala Leu Asp Gin Glu He He 
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85 90 95 



Glu Val Asp Pro Asp Thr Lys Glu Met Leu Lys Leu Leu Asp Phe Gly 
100 105 110 



Ser Leu Ser Asn Leu Gin Val He His Pro Asn Cys Arg Leu Ser Asp 
115 120 125 



Leu Lys Val Gly Gin Thr Ala Gin Pro Thr Val Gly Met Asn Phe Lys 
130 135 140 



Thr Pro Arg Gly Pro Val 
150 



145 




<210> 


129 


<211> 


298 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


129 



Met Arg Leu Ala Ala Leu Ala Val Ser Ala Cys He Leu Phe Arg Glu 
15 10 15 



Ala Leu Leu Arg Pro Trp Thr Gly Pro Pro Glu Arg Met Pro Val Arg 
20 25 30 



Ala Ala Arg Gly Glu Gly Pro Val Ala Met Gly Arg Val He Arg Gly 
35 40 45 



Gin Arg Lys Gly Ala Gly Ser Val Phe Arg Ala His Val Lys His Arg 
50 55 60 



Lys Gly Ala Ala Arg Leu Arg Ala Val Asp Phe Ala Glu Arg His Gly 
65 70 75 80 



Tyr He Lys Gly He Val Lys Asp He He His Asp Pro Gly Arg Gly 
85 90 95 



Ala Pro Leu Ala Lys Val Val Phe Arg Asp Pro Tyr Arg Phe Lys Lys 
100 105 110 



Arg Thr Glu Leu Phe He Ala Ala Glu Gly He His Thr Gly Gin Phe 
115 120 125 



Val Tyr Cys Gly Lys Lys Ala Gin Leu Asn He Gly Asn Val Leu Pro 
130 135 140 
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yal Gly Thr Met Pro Glu Gly Thr He Val Cys Cys Leu Glu Glu Lys 
150 

Pro Gly Asp Arg Gly Lys Leu Ala Arg Ala Ser Gly Asn Tyr Ala Thr 
165 170 175 

Val He Ser His Asn Pro Glu Thr Lys Lys Thr Arg Val Lys Leu Pro 
180 185 190 

Ser Gly Ser Lys Lys Val He Ser Ser Ala Asn Arg Ala Val Val Gly 
155 200 205 

val yal Ala Gly Gly Gly Arg He Asp Lys Pro He Leu Lys Ala Gly 
210 215 220 



Arg Ala Tyr His Lys Tyr Lys Ala Lys Arg Asn Cys Trp Pro Arg Val 
230 235 ^ 240 

Arg Gly Val Ala Met Asn Pro Val Glu His Pro Phe Gly Gly Gly Asn 
245 250 255 

His Gin His He Gly Lys Pro Ser Thr He Arg Arg Asp Ala Pro Ala 
260 265 270 

Gly Arg Lys Val Gly Leu He Ala Ala Arg Arg Thr Gly Arg Leu Arg 
275 280 285 



Gly Thr Lys Thr Val Gin Glu Lys Glu Asn 
290 295 



<210> 130 

<2I1> 271 

<212> PRT 

<213> Homo sapien 



<220> 

<221> MISC FEATURE 

<222> {1).T(2) 

<223> Xsany amino acid 



<400> 130 



Xaa Xaa Ala Gly Ala Gly Ala Arg Gly Glu Gly Pro Val Ala Met 



1 5 



10 15 



Gly 



Arg Val He Arg Gly Gin Arg Lys Gly Ala Gly Ser Val Phe Arg Ala 
2° 25 30 
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His Val Lys His Arg Lys Gly Ala Ala Arg Leu Arg Ala Val Asp Phe 
35 40 45 



Ala Glu Arg His Gly Tyr lie Lys Gly lie Val Lys Asp He He His 
50 55 60 



Asp Pro Gly Arg Gly Ala Pro Leu Ala Lys Val Val Phe Arg Asp Pro 
65 70 75 80 



Tyr Arg Phe Lys Lys Arg Thr Glu Leu Phe He Ala Ala Glu Gly He 
85 90 95 



His Thr Gly Gin Phe Val Tyr Cys Gly Lys Lys Ala Gin Leu Asn He 
100 105 110 



Gly Asn Val Leu Pro Val Gly Thr Met Pro Glu Gly Thr He Val Cys 
115 120 125 



Cys Leu Glu Glu Lys Pro Gly Asp Arg Gly Lys Leu Ala Arg Ala Ser 
130 135 140 



Gly Asn Tyr Ala Thr Val He Ser His Asn Pro Glu Thr Lys Lys Thr 
145 150 155 160 



Arg Val Lys Leu Pro Ser Gly Ser Lys Lys Val He Ser Ser Ala Asn 
165 170 175 



Arg Ala Val Val Gly Val Val Ala Gly Gly Gly Arg He Asp Lys Pro 
180 185 190 



He Leu Lys Ala Gly Arg Ala Tyr His Lys Tyr Lys Ala Lys Arg Asn 
195 200 205 



Cys Trp Pro Arg Val Arg Gly Val Ala Met Asn Pro Val Glu His Pro 
210 215 220 



Phe Gly Gly Gly Asn His Gin His He Gly Lys Pro Ser Thr He Arg 
225 230 235 240 



Arg Asp Ala Pro Ala Gly Arg Lys Val Gly Leu He Ala Ala Arg Arg 
245 250 255 



Thr Gly Arg Leu Arg Gly Thr Lys Thr Val Gin Glu Lys Glu Asn 
260 265 270 
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<210> 131 
<211> 550 
<212> PRT 
<213> Homo sapien 

<400> 131 

Met Met Lys Ala Ala Gly Lys Gin Gin Arg Val Gin Gin Gin His Ser 
15 10 15 



Ser Ala Gin His Gin Gin His Ala Cys Thr Ala Asn Ser Pro Lys His 
20 25 30 



Arg Lys His Val Gly Ser Ser Met Gin Ala Gly Met His Ser Arg Ser 
35 40 45 



Gin Ala Ser Ser Thr Ala Gin Gin Gin Leu Lys His Ser lie Gin Gin 
50 55 60 



Gin Gin lie Pro Leu His Pro Gly Thr Ala Thr Gin Thr Ser Thr Lys 
65 70 75 80 



Pro lie Trp Thr Arg Asn Pro Asp Asp lie Thr Gin Glu Glu Tyr Gly 
85 90 95 



Glu Phe Tyr Lys Ser Leu Thr Asn Asp Trp Glu Asp His Leu Ala Val 
100 105 110 



Lys His Phe Ser Val Glu Gly Gin Leu Glu Phe Arg Ala Leu Leu Phe 
115 120 125 



lie Pro Arg Arg Ala Pro Phe Asp Leu Cys Glu Asn Lys Lys Lys Lys 
130 135 140 



Asn Asn lie Lys Leu Tyr Val Arg Arg Val Phe lie Met Asp Ser Cys 
145 150 155 160 



Asp Glu Leu He Pro Glu Tyr Leu Asn Phe lie Arg Gly Val Val Asp 
165 170 175 



Ser Glu Asp Leu Pro Leu Asn He Ser Arg Glu Met Leu Gin Gin Ser 
180 185 190 



Lys He Leu Lys Val His Ser Gin Gin Thr Leu Leu Arg Ser Ala Leu 
195 200 205 



Ser Ser Ser Leu Glu Leu Ala Glu Asp Lys Ala Glu Leu Gin Asp Asn 
210 215 220 
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Ser Tyr Glu Gly Thr Ser His Lys Asn Leu Asn Ala Trp Asn Pro Arg 
225 230 235 240 



Arg His Pro Leu Thr Gly Ala Ala Cys Leu Glu Leu Leu Arg Tyr His 
245 250 255 



Thr Ser Gin Ser Gly Asp Glu Met Thr Ser Leu Ser Glu Tyr Val Ser 
260 265 270 



Arg Met Lys Glu Thr Gin Lys Ser lie Tyr Tyr lie Thr Gly Glu Ser 
275 280 285 



Lys Glu Gin Val Ala Asn Ser Ala Phe Val Glu Arg Val Arg Lys Arg 
290 295 300 



Gly Phe Glu Val Val Tyr Met Thr Glu Pro lie Asp Glu Tyr Cys Val 
305 310 315 320 



Gin Gin Leu Lys Glu Phe Asp Gly Lys Ser Leu Val Ser Val Thr Lys 
325 330 335 



Glu Gly Leu Glu Leu Pro Glu Asp Glu Glu Glu Lys Lys Lys Met Glu 
340 345 350 



Glu Ser Lys Ala Lys Phe Glu Asn Leu Cys Lys Leu Met Lys Glu lie 
355 360 365 



Leu Asp Lys Lys Val Glu Lys Val Thr lie Ser Asn Arg Leu Val Ser 
370 375 380 



Ser Pro Cys Cys lie Val Thr Ser Thr Tyr Gly Trp Thr Ala Asn Met 
385 390 395 400 



Glu Arg lie Met Lys Ala Gin Ala Leu Arg Asp Asn Ser Thr Met Gly 
405 410 415 



Tyr Met Met Ala Lys Lys His Leu Glu lie Asn Pro Asp His Pro lie 
420 425 430 



Val Glu Thr Leu Arg Gin Lys Ala Glu Ala Asp Glu Asn Asp Lys Ala 
435 440 445 



Val Lys Asp Leu Val Val Leu Leu Phe Glu Thr Ala Leu Val Ser Ser 
450 455 460 
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Gly Phe Ser Leu Glu Asp Pro Gin Thr Gin Ser Asn Arg He Tyr Arg 
465 470 475 480 



Met He Lys Leu Gly Leu Gly He Asp Glu Asp Glu Val Ala Ala Glu 
485 490 495 



Glu Pro Asn Ala Ala Val Pro Asp Glu He Pro Pro Leu Glu Gly Asp 
500 505 510 



Glu Asp Ala Ser Arg Met Arg Gly Ser Arg Val Arg Leu Gly Val Val 
515 520 525 



Leu Gly Asn Thr Cys Ala Phe Gly Phe Cys Val Pro His Gly Ala Pro 
530 535 540 



Thr Ala Pro Arg Val Pro 



545 


550 


<210> 


132 


<211> 


190 


<212> 


PRT 


<213> 


Homo sapien 


<220> 




<221> 


MISC FEATURE 


<222> 


(181) . . (181) 


<223> 


X=any amino acid 



<400> 132 

Glu Leu Leu Arg Tyr His Thr Ser Gin Ser Gly Asp Glu Met Thr Ser 
15 10 15 



Leu Ser Glu Tyr Val Ser Arg Met Lys Glu Thr Gin Lys Ser He Tyr 
20 25 30 



Tyr He Thr Gly Glu Ser Lys Glu Gin Val Ala Asn Ser Ala Phe Val 
35 40 45 



Glu Arg Val Arg Lys Arg Gly Phe Glu Val Val Tyr Met Thr Glu Pro 
50 55 60 



He Asp Glu Tyr Cys Val Gin Gin Leu Lys Glu Phe Asp Gly Lys Ser 
65 70 75 80 



Leu Val Ser Val Thr Lys Glu Gly Leu Glu Leu Pro Glu Asp Glu Glu 
85 90 95 
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Glu Lys Lys Lys Met Glu Glu Ser Lys Ala Lys Phe Glu Asn Leu Cys 
100 105 110 



Lys Leu Met Lys Glu He Leu Asp Lys Lys Val Glu Lys Val Thr He 
115 120 125 



Ser Asn Arg Leu Val Ser Ser Pro Cys Cys lie Val Thr Ser Thr Tyr 
130 135 140 



Gly Trp Thr Ala Asn Met Glu Arg He Met Lys Ala Gin Ala Leu Arg 
145 150 155 160 



Asp Asn Ser Thr Met Gly Tyr Met Met Ala Lys Lys His Leu Glu He 
165 170 175 



Asn Pro Asp His Xaa His Cys Gly Asp Ala Ala Ala Glu Gly 
180 185 190 



<210> 133 

<211> 111 

<212> PRT 

<213> Homo sapien 

<400> 133 

Met Gly Val Asp He Arg His Asn Lys Asp Arg Lys Val Arg Arg Lys 
15 10 IS 



Glu Pro Lys Ser Gin Asp He Tyr Leu Arg Leu Leu Val Lys Leu Tyr 
20 25 30 



Arg Phe Leu Ala Arg Arg Thr Asn Ser Thr Phe Asn Gin Val Val Leu 
35 40 45 



Lys Arg Leu Phe Met Ser Arg Thr Asn Arg Pro Pro Leu Ser Leu Ser 
50 55 60 



Arg Met He Arg Lys Met Lys Leu Pro Gly Arg Glu Asn Lys Thr Ala 
65 70 75 80 



Val Val Val Gly Thr He Thr Asp Asp Val Arg Val Gin Glu Val Pro 
85 90 95 



Arg Arg Asp His Ala Ser He Thr Leu Arg Arg Ser Thr Cys He 
100 105 110 



<210> 134 
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<211> 261 
<212> PRT 
<213> Homo sapien 

<400> 134 

Phe Pro Arg Glu Ser Gly Pro Arg Pro Val Pro Arg Thr Asp Ser Gly 
15 10 15 



Ala Ser Val Gly Ala Gly Cys Leu Arg Thr Leu Ala Val Gly Pro Gly 
20 25 30 



Gin Glu Gly Ala Gly Gly Arg Asp Ser Gly Cys Thr Val He Trp Arg 
35 40 45 



Ser Ala Ala Gly Pro Thr Gly He Arg Gly Phe Gly Gly Ala Arg Arg 
50 55 60 



Pro Gly Ser Glu Leu Gly Ser Cys Cys Ala Ala His Val Leu Thr Ser 
65 70 75 80 



Ala Ser Asp Val Trp Ser Tyr Gly He Val Met Trp Glu Val Met Ser 
85 90 95 



Tyr Gly Glu Arg Pro Tyr Trp Asp Met Ser Asn Gin Asp Val He Asn 
100 105 110 



Ala Val Glu Gin Asp Tyr Arg Leu Pro Pro Pro Met Asp Cys Pro Thr 
115 120 125 



Ala Leu His Gin Leu Met Leu Asp Cys Trp Val Arg Asp Arg Asn Leu 
130 135 140 



Arg Pro Lys Phe Ser Gin He Val Asn Thr Leu Asp Lys Leu He Arg 
145 150 155 160 



Asn Ala Ala Ser Leu Lys Val He Ala Ser Ala Gin Ser Gly Met Ser 
165 170 175 



Gin Pro Leu Leu Asp Arg Thr Val Pro Asp Tyr Thr Thr Phe Thr Thr 
180 185 190 



Val Gly Asp Trp Leu Asp Ala He Lys Met Gly Arg Tyr Lys Glu Ser 
195 200 205 



Phe Val Ser Ala Gly Phe Ala Ser Phe Asp Leu Val Ala Gin Met Thr 
210 215 220 
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Ala Glu Asp Leu Leu Arg lie Gly Val Thr Leu Ala Gly His Gin Lys 
225 230 235 240 



Lys He Leu Ser Ser He Gin Asp Met Arg Leu Gin Met Asn Gin Thr 
245 250 255 



Leu Pro Val Gin Val 





260 


<210> 


135 


<211> 


361 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


135 



Met Pro Gly Val Cys Asp Arg Ala Pro Asp Phe Leu Ser Pro Ser Glu 
15 10 15 



Asp Gin Val Leu Arg Pro Ala Leu Gly Ser Ser Val Ala Leu Asn Cys 
20 25 30 



Thr Ala Trp Val Val Ser Gly Pro His Cys Ser Leu Pro Ser Val Gin 
35 40 45 



Trp Leu Lys Asp Gly Leu Pro Leu Gly He Gly Gly His Tyr Ser Leu 
50 55 60 



His Glu Tyr Ser Trp Val Lys Ala Asn Leu Ser Glu Val Leu Val Ser 
65 70 75 80 



Ser Val Leu Gly Val Asn Val Thr Ser Thr Glu Val Tyr Gly Ala Phe 
85 90 95 



Thr Cys Ser He Gin Asn He Ser Phe Ser Ser Phe Thr Leu Gin Arg 
100 105 110 



Ala Gly Pro Thr Ser His Val Ala Ala Val Leu Ala Ser Leu Leu Val 
115 120 125 



Leu Leu Ala Leu Leu Leu Ala Ala Leu Leu Tyr Val Lys Cys Arg Leu 
130 135 140 



Asn Val Leu Leu Trp Tyr Gin Asp Ala Tyr Gly Glu Val Glu He Asn 
145 150 155 160 



Asp Gly Lys Leu Tyr Asp Ala Tyr Val Ser Tyr Ser Asp Cys Pro Glu 
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165 170 175 



Asp Arg Lys Phe Val Asn Phe He Leu Lys Pro Gin Leu Glu Arg Arg 
180 185 190 



Arg Gly Tyr Lys Leu Phe Leu Asp Asp Arg Asp Leu Leu Pro Arg Ala 
195 200 205 



Glu Pro Ser Ala Asp Leu Leu Val Asn Leu Ser Arg Cys Arg Arg Leu 
210 215 220 



He Val Val Leu Ser Asp Ala Phe Leu Ser Arg Ala Trp Cys Ser His 
225 230 235 240 



Ser Phe Arg Tro Val Pro Arg Gly Val Gly Trp Ala Pro Ala Tyr Thr 
245 250 255 



His Pro Pro Asp Gly Pro Ala Pro Gin Gly Gly Pro Val Pro Ala Ala 
260 265 270 



Gly Ala His Pro Gin Thr His Leu His His Leu Arg Gly Pro Glu Ala 
275 280 285 



Arg Pro Arg Ala Pro Gly Ala Pro Pro Ala Ala Pro Ala Pro Pro Pro 
290 295 300 



Gly Asp Leu Ala Ala Leu Glu Ala Arg Leu Arg Asp Ser Phe Leu Arg 
305 310 315 320 



Phe Leu Glu Arg Ser Ala Ala Gly Ala Ala Ala Glu Gly Ala Val Gin 
325 330 335 



Ala Gly Gly Arg Arg Pro Pro Asp Ala Ala Ala Gly Arg Gin Gly Pro 
340 345 350 



His Ala Asp Ser Ser Arg Pro Ser Pro 

360 





355 


<210> 


136 


<211> 


329 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


136 



Met Pro Gly Val Cys Asp Arg Ala Pro Asp Phe Leu Ser Pro Ser Glu 
15 10 15 
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Asp Gin Val Leu Arg Pro Ala Leu Gly Ser Ser Val Ala Leu Asn Cys 
20 25 30 



Thr Ala Trp Val Val Ser Gly Pro His Cys Ser Leu Pro Ser Val Gin 
35 40 45 



Trp Leu Lys Asp Gly Leu Pro Leu Gly lie Gly Gly His Tyr Ser Leu 
50 55 60 



His Glu Tyr Ser Trp Val Lys Ala Asn Leu Ser Glu Val Leu Val Ser 
65 70 75 80 



Ser Val Leu Gly Val Asn Val Thr Ser Thr Glu Val Tyr Gly Ala Phe 
85 90 95 



Thr Cys Ser He Gin Asn He Ser Phe Ser Ser Phe Thr Leu Gin Arg 
100 105 110 



Ala Gly Pro Thr Ser His Val Ala Ala Val Leu Ala Ser Leu Leu Val 
115 120 125 



Leu Leu Ala Leu Leu Leu Ala Ala Leu Leu Tyr Val Lys Cys Arg Leu 
130 135 140 

Asn Val Leu Leu Trp Tyr Gin Asp Ala Tyr Gly Glu Val Glu He Asn 

145 150 155 160 



Asp Gly Lys Leu Tyr Asp Ala Tyr Val Ser Tyr Ser Asp Cys Pro Glu 
165 170 175 



Asp Arg Lys Phe Val Asn Phe He Leu Lys Pro Gin Leu Glu Arg Arg 
180 185 190 



Arg Gly Tyr Lys Leu Phe Leu Asp Asp Arg Asp Leu Leu Pro Arg Ala 
195 200 205 



Glu Pro Ser Ala Asp Leu Leu Val Asn Leu Ser Arg Cys Arg Arg Leu 
210 215 220 



He Val Val Leu Ser Asp Ala Phe Leu Ser Arg Ala Trp Cys Ser His 
225 230 235 240 



Ser Phe Arg Trp Val Pro Arg Gly Val Gly Trp Ala Pro Ala Tyr Thr 
245 250 255 
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His Pro Pro Asp Gly Pro Ala Pro Gin Gly Gly Pro Val Pro Ala Ala 
260 265 270 



Gly Ala His Pro Gin Thr His Leu His His Leu Arg Gly Pro Glu Ala 
275 280 285 



Arg Pro Arg Ala Pro Gly Ala Pro Pro Ala Ala Pro Ala Pro Pro Pro 
290 295 300 



Gly Asp Leu Ala Ala Leu Glu Ala Arg Leu Arg Gly Ala Glu Gin Ala 
305 310 315 320 



Arg Glu Gly Pro Gly Leu Ala Ala Gly 
325 



<210> 137 

<211> 164 

<212> PRT 

<213> Homo sapien 

<400> 137 

Pro Pro Pro Leu Arg Arg Arg Arg Pro Pro Ser Arg Arg Ala Leu Arg 
15 10 15 



Arg Pro Leu Gly Glu Pro Glu Pro Leu Pro Thr Pro His Arg Gly Ala 
20 25 30 



Phe Gly Arg Leu Pro Glu Pro Gly Leu Val Gin Pro Gin Leu Pro Thr 
35 40 45 



Pro Ser Ser Asp Phe Trp Lys Glu Val Gin Leu Ala Leu Pro Arg Lys 
50 55 60 



Val Arg Tyr Arg Pro Val Glu Gly Asp Pro Gin Thr Gin Leu Gin Asp 
65 70 75 80 



Asp Lys Asp Pro Met Leu He Leu Arg Gly Arg Val Pro Glu Gly Arg 
65 90 95 



Ala Leu Asp Ser Glu Val Asp Pro Asp Pro Glu Gly Asp Leu Gly Val 
100 105 110 



Arg Gly Pro Val Phe Gly Glu Pro Ser Ala Pro Pro His Thr Ser Gly 
115 120 125 



Val Ser Leu Gly Glu Ser Arg Ser Ser Glu Val Asp Val Ser Asp Leu 
130 135 140 
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Gly Ser Arg Asn Tyr Ser Ala Arg Thr Asp Phe Tyr Cys Leu Val Ser 
145 150 155 160 



Lys Asp Asp Met 



<210> 138 

<211> 66 

<212> PRT 

<213> Homo sapien 

<400> 138 

Met Leu Leu Glu Arg Arg Ser Val Met Asp Arg Gly Arg Gly Glu Glu 
15 10 15 



Trp Arg Ala Arg Ser Glu Ser Ala Gin Ser Lys Met Leu Ser Gly Val 
20 25 30 



Gly Gly Phe Val Leu Gly Leu Leu Phe Leu Gly Ala Gly Leu Phe He 
35 40 45 



Tyr Phe Arg Asn Gin Lys Gly His Ser Gly Leu Gin Pro Thr Gly Phe 
50 55 60 



Leu Ser 
65 



<210> 


139 




<211> 


135 




<212> 


PRT 




<213> 


Homo 


sapien 


<400> 


139 




Pro His Ser 


Arg Lys 


1 




5 



10 15 



Phe Pro Ala Trp Gin Leu Phe Phe His Lys Arg Gly Leu Ser Gin Asp 
20 25 30 



Leu Val Ala Thr Gly Ser Ala Thr Ala Glu Asn Val Leu Pro Cys Gly 
35 40 45 



Phe Leu Ser Ser Cys Pro Trp Pro Glu Val Pro Ala Leu Met Ala Ala 
50 55 60 



Pro His Leu Gin Leu Leu Cys Ser Pro Leu Pro Lys Pro Tyr Gly Leu 
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65 70 75 80 



Pro Cys lie Cys Thr His Pro Val Arg Gin Thr His Tyr He He Lys 
85 90 95 



Cys Phe Ser Lys Met Glu Leu Asn He He Trp Ser He Trp Leu Gin 
100 105 110 



Arg Gin Lys Met Lys Arg Lys Arg Glu Asp Tyr Phe Pro Asn Arg He 
115 120 125 



Met He Phe Met Tyr Met Ser 
130 135 



<210> 140 

<211> 115 

<212> PRT 

<213> Homo sapien 

<400> 140 

Met Lys Ser Phe Pro Ala Trp Gin Leu Phe Phe His Lys Arg Gly Leu 
15 10 15 



Ser Gin Asp Leu Val Ala Thr Gly Ser Ala Thr Leu Gin Lys Met Ser 
20 25 30 



He Pro Cys Gly Phe Leu Ser Ser Cys Pro Trp Pro Glu Val Pro Ala 
35 40 45 



Leu Met Ala Ala Pro His Leu Gin Leu Leu Cys Ser Pro Leu Pro Lys 
50 55 60 



Pro Tyr Gly Leu Pro Cys He Cys Thr His Pro Val Arg Gin Thr His 
65 70 75 80 



Tyr He He Lys Cys Phe Ser Lys Met Glu Leu Asn He He Trp Ser 
85 90 95 



He Trp Leu Gin Arg Gin Lys Met Lys Arg Lys Arg Glu Asp Tyr Phe 
100 105 110 



Pro He Glu 
115 



<210> 141 
<211> 135 
<212> PRT 
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<213> Homo sapien 
<400> 141 

Pro His Ser Arg Lys Asn Leu Leu Pro Gin Leu Cys Arg Met Lys Ser 
15 10 15 

Phe Pro Ala Trp Gin Leu Phe Phe His Lys Arg Gly Leu Ser Gin Asp 
20 25 30 

Leu Val Ala Thr Gly Ser Ala Thr Ala Glu Asn Val Leu Pro Cys Gly 
35 40 45 

Phe Leu Ser Ser Cys Pro Trp Pro Glu Val Pro Ala Leu Met Ala Ala 
50 55 60 

Pro His Leu Gin Leu Leu Cys Ser Pro Leu Pro Lys Pro Tyr Gly Leu 
65 70 75 80 

Pro Cys He Cys Thr His Pro Val Arg Gin Thr His Tyr He He Lys 
85 90 95 

Cys Phe Ser Lys Met Glu Leu Asn He He Trp Ser He Trp Leu Gin 
100 105 110 

Arg Gin Lys Met Lys Arg Lys Arg Glu Asp Tyr Phe Pro Asn Arg He 
115 120 125 



Met He Phe Met Tyr Met Ser 
130 135 



<210> 142 

<211> 220 

<212> PRT 

<213> Homo sapien 

<400> 142 

Met Asp Gin His Phe Arg Thr Thr Pro Leu Glu Lys Asn Ala Pro Val 
15 10 15 



Leu Leu Ala Leu Leu Gly He Trp Tyr He Asn Cys Phe Gly Cys Glu 
20 25 30 



Thr His Ala Met Leu Pro Tyr Asp Gin Tyr Leu His Arg Phe Ala Ala 
35 40 45 



Tyr Phe Gin Gin Gly Asp Met Glu Ser Asn Gly Lys Tyr He Thr Lys 
50 55 60 
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Ser Gly Thr Arg Val Asp His Gin Thr Gly Pro He Val Trp Gly Glu 
65 70 75 80 



Pro Gly Thr Asn Gly Gin His Ala Phe Tyr Gin Leu He His Gin Gly 
85 90 95 



Thr Lys Met He Pro Cys Asp Phe Leu He Pro Val Gin Thr Gin His 
100 105 110 



Pro He Arg Lys Gly Leu His His Lys He Leu Leu Ala Asn Phe Leu 
115 120 125 



Ala Gin Thr Glu Ala Leu Met Arg Gly Lys Ser Thr Glu Glu Ala Arg 
130 135 140 



Lys Glu Leu Gin Ala Ala Gly Lys Ser Pro Glu Asp Leu Glu Arg Leu 
145 150 155 160 



Leu Pro His Lys Val Phe Glu Gly Asn Arg Pro Thr Asn Ser He Val 
165 170 175 



Phe Thr Lys Leu Thr Pro Phe Met Leu Gly Ala Leu Val Ala Met Tyr 
180 185 190 



Glu His Lys He Phe Val Gin Gly He He Trp Asp He Asn Ser Phe 
195 200 205 



Asp Gin Trp Gly Ser Gly Ala Gly Lys Ala Ala Gly 
210 215 220 



<210> 


143 


<211> 


287 


<212> 


PRT 


<213> 


Homo ! 


<220> 




<221> 


MISC ] 


<222> 


(7) . . 


<223> 


X=any 


<400> 


143 



Val Arg Gly Leu Gly Gly Xaa Ala He Gly Leu Ser He Ala Leu His 
15 10 15 



Val Gly Phe Asp Asn Phe Glu Gin Leu Leu Ser Gly Ala His Trp Met 
20 25 30 
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Asp Gin His Phe Arg Thr Thr Pro Leu Glu Lys Asn Ala Pro Val Leu 
35 40 45 



Leu Ala Leu Leu Gly He Trp Tyr He Asn Cys Phe Gly Cys Glu Thr 
50 55 60 



His Ala Met Leu Pro Tyr Asp Gin Tyr Leu His Arg Phe Ala Ala Tyr 
65 70 75 80 



Phe Gin Gin Gly Asp Met Glu Ser Asn Gly Lys Tyr He Thr Lys Ser 
85 90 95 



Gly Thr Arg Val Asp His Gin Thr Gly Pro He Val Trp Gly Glu Pro 
100 105 110 



Gly Thr Asn Gly Gin His Ala Phe Tyr Gin Leu He His Gin Gly Thr 
115 120 125 



Lys Met He Pro Cys Asp Phe Leu He Pro Val Gin Thr Gin His Pro 
130 135 140 



He Arg Lys Gly Leu His His Lys He Leu Leu Ala Asn Phe Leu Ala 
145 150 155 160 



Gin Thr Glu Ala Leu Met Arg Gly Lys Ser Thr Glu Glu Ala Arg Lys 
165 170 175 



Glu Leu Gin Ala Ala Gly Lys Ser Pro Glu Asp Leu Glu Arg Leu Leu 
180 185 190 



Pro His Lys Val Phe Glu Gly Asn Arg Pro Thr Asn Ser He Val Phe 
195 200 205 



Thr Lys Leu Thr Pro Phe Met Leu Gly Ala Leu Val Ala Met Tyr Glu 
210 215 220 



His Lys He Phe Val Gin Gly He He Trp Asp He Asn Ser Phe Asp 
225 230 235 240 



Gin Trp Gly Val Glu Leu Gly Lys Gin Leu Ala Lys Lys He Glu Pro 
245 250 255 



Glu Leu Asp Gly Ser Ala Gin Val Thr Ser His Asp Ala Ser Thr Asn 
260 265 270 
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Gly Leu He Asn Phe He Lys Gin Gin Arg Glu Ala Arg Val Gin 
275 280 285 



<210> 144 

<211> 147 

<212> PRT 

<213> Homo sapien 

<400> 144 

Met Ala Pro Gly Arg Gly Leu Gly His Ala Trp Leu Val Leu Gin Asn 
15 10 15 



Gly Arg Ala Cys Pro His Arg Pro Ala Arg Leu Ser Leu Trp Gly Arg 
20 25 30 



Val Cys Phe Pro Ser Arg Gly Leu Gly He Arg Thr Leu Leu Glu Thr 
35 40 45 



Phe Leu Gly Val Phe Cys Arg Tyr Leu Lys Glu He Ala Gin Pro Thr 
50 55 60 



Leu Leu Cys Ser Pro Ser Ser His His Ser Cys Leu Glu Pro Trp Ser 
65 70 75 80 



Pro Cys Met Ser Thr Arg Ser Ser Phe Arg Ala Ser Ser Gly Thr Ser 
85 90 95 



Thr Ala Leu Thr Ser Gly Gly Val Glu Leu Gly Lys Gin Leu Ala Lys 
100 105 110 



Lys He Glu Pro Glu Leu Asp Gly Ser Ala Gin Val Thr Ser His Asp 
115 120 125 



Ala Ser Thr Asn Gly Leu He Asn Phe He Lys Gin Gin Arg Glu Ala 
130 135 140 



Arg Val Gin 



145 




<210> 


145 


<211> 


150 


<212> 


PRT 


<213> 


Homo sapien 


<220> 




<221> 


MISC FEATURE 


<222> 


<9) . . (10) 


<223> 


Xsany amino acid 
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<400> 145 

Ser Gin His Phe Gly Arg Pro Arg Xaa Xaa Asp His Leu Arg Ser Asp 
15 10 15 



Gin Ser Gly Gin His Gly Glu Thr Pro Ser Val Pro Lys lie Gin Lys 
20 25 30 



Pro Ala Gly His Gly Gly Thr Cys Leu Trp Ser Gin Leu Leu Gly Arg 
35 40 45 



Pro Arg Gin Lys Thr Arg Trp Asn Pro Gly Gly Gly Ala Cys Arg Glu 
50 55 60 



Pro Arg Leu Cys His Cys Thr Ala Ala Trp Val Thr Glu Pro Asp Ser 
65 70 75 80 



lie Ser Thr Thr Asp Ala Leu Thr Leu Gly Val Ser Val Ala Gin Gly 
85 90 95 



Arg Gly Ala His Val Thr Gin Ala Asp Gly Pro Phe Ala Thr Ala Val 
100 105 110 



Asp Glu His Val Ala Leu Val Arg Val Glu Leu Gly Cys Ser Asp Asp 
115 120 125 



Phe Gly Gin Leu Leu His Val Ser Arg Leu Asp Val His Asp Val Lys 
130 135 140 



Ala Ser lie Cys Asp Phe 
145 150 



<210> 146 

<211> 811 

<212> PRT 

<213> Homo sapien 

<400> 146 

Met Thr Asp lie Leu Phe Leu Pro Met Trp lie Ser Asn Gin His Thr 
15 10 15 



Pro Ser Ser Pro Gin Gly Asp Gly Gly Ser Ala His Thr Phe lie Ser 
20 25 30 



Thr Gly Gly Pro Gly He Ser Thr Arg Leu His Leu His Arg Gly Met 
35 40 45 
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Gly Asp Gin His Thr Pro Ser Ser Pro Gin Trp Asp Gly Gly Ser Ala 
50 55 60 



His Ala Phe lie Ser Thr Gly Gly Trp Gly Met Ser Thr Arg Leu His 
65 70 75 80 



Leu His Arg Gly Met Ala Asp Gin His Thr Pro Ser Ser Pro Gin Gly 
85 90 95 



Asp Gly Gly Ser Ala His Ala Phe lie Ser Thr Gly Gly Arg Gly He 
100 105 110 



Ser Thr Arg Leu His Leu His Arg Arg Thr Gly Asp Gin His Thr Pro 
115 120 125 



Ser Ser Pro Gin Gly Asp Arg Gly Ser Ala His Thr Phe He Ser Thr 
130 135 140 



Gly Gly Trp Gly He Ser Thr His Leu His Leu His Arg Gly Met Gly 
145 150 155 160 



Asp Gin His Thr Pro Ser Ser Pro Gin Gly Asp Gly Gly Ser Ala His 
165 170 175 



Ala Phe He Ser Thr Gly Gly Trp Gly He Ser Thr Arg Leu His Leu 
180 185 190 



His Ser Gly Met Ala Asp Gin His Thr Pro Ser Ser Pro Gin Gly Asp 
195 200 205 



Gly Gly Ser Ala His Thr Phe He Ser Thr Gly Gly Trp Gly He Ser 
210 215 220 



Thr Arg Leu His Leu His Ser Gly Met Ala Asp Gin His Thr Pro Ser 
225 230 235 240 



Ser Pro Gin Gly Asp Gly Gly Ser Ala His Ala Phe He Ser Thr Val 
245 250 . 255 



Gly Arg Gly He Ser Thr His Leu His Leu His Arg Gly Thr Gly Asp 
260 265 270 



Gin His Thr Pro Pro Ser Pro Gin Gly His Glu Glu Ala Ala His Thr 
275 280 285 
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Phe He Ser Thr Gly Gly Arg Gly He Ser Thr His Leu His Leu His 
290 295 300 



Arg Gly Met Gly Asp Gin His Thr Pro Pro Ser Pro Gin Gly Asp Lys 
305 310 315 320 



Arg Ser Ala His Thr Phe He Pro Thr Gly Gly Gin Gly He Ser He 
325 330 335 



Pro Leu His Leu His Arg Gly Met Gly Asp Gin His Thr Pro Ser Ser 
340 345 350 



Pro Gin Gly Asp Gly Gly Ser Ala Tyr Pro Phe He Ser Thr Gly Gly 
355 360 365 



Trp Gly He Ser Thr His Leu His Pro His Arg Gly Met Gly Asp Gin 
370 375 380 



His Thr Pro Pro Ser Pro Gin Gly His Glu Glu Ser Ala His Thr Phe 
385 390 395 400 



He Ser Thr Gly Arg Arg Gly He Ser Thr Pro Leu His Leu His Arg 
405 410 415 



Gly Met Gly Asp Gin His Thr Pro Ser Ser Pro Gin Gly Asp Gly Gly 
420 425 430 



Ser Ala Val His Thr Phe He Lys He Gly Glu Gin Gly He Ser Thr 
435 440 445 



His Leu Tyr Leu His Arg Gly Thr Arg Asp Gin His Thr Pro Pro Ser 
450 455 460 



Pro Gin Gly Met Gly Asp Gin His Thr Pro Ser Ser Pro Gin Gly Asp 
465 • 470 475 480 



Gly Asp Gin His Thr Pro Ser Ser Pro Gin Gly Asp Gly Gly Ser Thr 
485 490 495 



His Pro Phe He Ser Thr Gly Asp Gly Gly Ser Ala His Thr Phe He 
500 505 510 



Ser Thr Gly Gly Arg Gly He Ser Thr Arg Leu His Val His Arg Gly 
515 520 525 
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Thr Gly Asp Gin His Thr Pro Ser Ser Ser Gin Gly Asp Gly Gly Ser 
530 535 540 



Ala His Thr Phe He Ser Thr Gly Gly Arg Gly Ser Ala His Thr He 
545 550 555 560 



Ser Thr Gly Gly Gin Gly He Asn Thr Pro Leu His Leu His Met Gly 
565 570 575 



Met Gly Asp Gin His Thr Pro Ser Ser Pro Gin Gly Asp Gly Asp Gin 
580 585 590 



His Thr Pro Pro Ser Pro Gin Gly Arg Gly Gly Leu Ala His Pro Phe 
595 600 605 



He Ser Thr Gly Arg Trp Gly He Ser Thr His Leu His Leu His Arg 
610 615 620 



Gly Thr Gly Asp Gin His Thr Pro Ser Ser Pro Gin Trp Asp Arg Gly 
625 630 635 640 



Ser Ala Tyr Pro Phe He Ser Thr Gly Gly Trp Gly Ser Ala His Thr 
645 650 655 



Phe He Ser Thr Glu Glu Met Gly Asp Gin His Ala Pro Ser Ser Pro 
660 665 670 



Gin Gly His Gly Gly Ser Ala His Thr Phe He Ser Thr Gly Gly Arg 
675 680 685 



Gly He Ser Thr His Leu His Pro Asp Arg Gly Met Arg Asn Gin His 
690 695 700 



Thr Pro Ser Ser Arg Gin Gly Asp Gly Met Gly Asp Gin His Thr Pro 
705 710 715 720 



Pro Ser Pro Gin Gly His Glu Gly Ala Ala His Thr Ser He Ser Thr 
725 730 735 



Gly His Arg Gly Ser Ala His Thr Ser Phe Ser Thr Gly Ala Gin Ala 
740 745 750 



He Ser Thr Tyr Leu His Leu Asp Arg Val Thr Gly Asp Gin His Thr 
755 760 765 



Pro Pro Ser Pro Gin Gin Gin Glu Glu Ser Thr His Thr Phe He Ser 
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770 775 780 



Thr Gly Gly Arg Gly He Ser Thr His Leu His Leu His Arg Gly Thr 
785 790 795 800 



Gly Ala Arg Leu Pro Thr Pro Leu Gly Asp Thr 
805 810 



<210> 147 

<211> 442 

<212> PRT 

<213> Homo sapien 

<400> 147 

Phe Arg Val Met Thr Asp He Leu Phe Leu Pro Mec Trp He Ser Asn 
15 10 15 



Gin His Thr Pro Ser Ser Pro Gin Gly Asp Gly Gly Ser Ala His Thr 
20 25 30 



Phe He Ser Thr Gly Gly Pro Gly He Ser Thr Arg Leu His Leu His 
35 40 45 



Arg Gly Met Gly Asp Gin His Thr Pro Ser Ser Pro Gin Trp Asp Gly 
50 55 60 



Gly Ser Ala His Ala Phe He Ser Thr Gly Gly Trp Gly Met Ser Thr 
65 70 75 80 



Arg Leu His Leu His Arg Gly Met Ala Asp Gin His Thr Pro Ser Ser 
85 90 95 



Pro Gin Gly Asp Gly Gly Ser Ala His Ala Phe He Ser Thr Gly Gly 
100 105 110 



Arg Gly He Ser Thr Arg Leu His Leu His Arg Arg Thr Gly Asp Gin 
115 120 125 



His Thr Pro Ser Ser Pro Gin Gly Asp Arg Gly Ser Ala His Thr Phe 
130 135 140 



He Ser Thr Gly Gly Trp Gly He Ser Thr His Leu His Leu His Arg 
145 150 155 160 



Gly Met Gly Asp Gin His Thr Pro Ser Ser Pro Gin Gly Asp Gly Gly 
165 170 175 
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Ser Ala His Ala Phe lie Ser Thr Gly Gly Trp Gly He Ser Thr Arg 
180 185 190 



Leu His Leu His Ser Gly Met Ala Asp Gin His Thr Pro Ser Ser Pro 
195 200 205 



Gin Gly Asp Gly Gly Ser Ala His Thr Phe He Ser Thr Gly Gly Trp 
210 215 220 



Gly He Ser Thr Arg Leu His Leu His Ser Gly Met Ala Asp Gin His 
225 230 235 240 



Thr Pro Ser Ser Pro Gin Gly Asp Gly Gly Ser Ala His Ala Phe He 
245 250 255 



Ser Thr Val Gly Arg Gly He Ser Thr His Leu His Leu His Arg Gly 
260 265 270 



Thr Gly Asp Gin His Thr Pro Pro Ser Pro Gin Gly His Glu Glu Ala 
275 280 285 



Ala His Thr Phe He Ser Thr Gly Gly Arg Gly He Ser Thr His Leu 
290 295 300 



His Leu His Arg Gly Met Gly Asp Gin His Thr Pro Pro Ser Pro Gin 
305 310 315 320 



Gly Asp Lys Arg Ser Ala His Thr Phe He Pro Thr Gly Gly Gin Gly 
325 330 335 



He Ser He Pro Leu His Leu His Arg Gly Met Gly Asp Gin His Thr 
340 345 350 



Pro Ser Ser Pro Gin Gly Asp Gly Gly Ser Ala Tyr Pro Phe He Ser 
355 360 365 



Thr Gly Gly Trp Gly He Ser Thr His Leu His Pro His Arg Gly Met 
370 375 380 



Gly Asp Gin His Thr Pro Pro Ser Pro Gin Gly His Glu Glu Ser Ala 
385 390 395 400 



His Thr Phe He Ser Thr Gly Arg Arg Gly He Ser Thr Pro Leu His 
405 410 , 415 
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Leu His Arg Gly Met Gly Asp Gin His Thr Pro Ser Ser Pro Gin Gly 
420 425 430 



Asp Gly Gly Ser Ala Val His Thr Phe lie 

440 





435 


<210> 


148 


<211> 


351 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


148 



Met Lys Ala Ser Gly Thr Leu Arg Glu Tyr Lys Val Val Gly Arg Cys 
15 10 15 



Leu Pro Thr Pro Lys Cys His Thr Pro Pro Leu Tyr Arg Met Arg lie 
20 25 30 



Phe Ala Pro Asn His Val Val Ala Lys Ser Arg Phe Trp Tyr Phe Val 
35 40 45 



Ser Gin Leu Lys Lys Met Lys Lys Ser Ser Gly Glu He Val Tyr Cys 
50 55 60 



Gly Gin Val Phe Glu Lys Ser Pro Leu Arg Val Lys Asn Phe Gly He 
65 70 75 80 



Trp Leu Arg Tyr Asp Ser Arg Ser Gly Thr His Asn Met Tyr Arg Glu 
85 90 95 



Tyr Arg Asp Leu Thr Thr Ala Gly Ala Val Thr Gin Cys Tyr Arg Asp 
100 105 110 



Met Gly Ala Arg His Arg Ala Arg Ala His Ser lie Gin He Met Lys 
115 120 125 



Val Glu Glu He Ala Ala Ser Lye Cys Arg Arg Pro Ala Val ^ Lys Gin 
130 135 140 



Phe His Asp Ser Lys He Lys Phe Pro Leu Pro His Arg Val Leu Arg 
145 150 155 160 



Arg Gin His Lys Pro Arg Phe Thr Thr Lys Arg Pro Asn Asn Leu Leu 
165 170 175 



Ser Arg Cys Arg Ala Leu Val Arg Gly Val Pro Pro Asn Lys Leu Arg 
180 185 190 
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Asn Ala Pro Lys Val Gin Ala Ala Val Pro Asp Asp Phe Lys Asp Phe 
195 200 205 



Ser Leu Leu Asn Glu Glu Ala Arg Tyr Tyr Gin Phe Lys Thr Met Val 
210 215 220 



Arg Arg Ala Trp Ser Ala Gly Thr His Asp Pro Glu Lys Ser Thr Gly 
225 230 235 240 



Asn Arg Asp Gly Glu Arg Leu Asp Ala Lys Ser Ser Ala Arg Arg Trp 
245 250 255 



Ala Lys Arg Asp Arg Thr Thr Arg Arg Ala Leu Pro Ala Glu Glu Glu 
260 265 270 



Tyr His Ser Asn Ala Lys Ala Thr Val Arg Gin Asn Lys Pro Arg Arg 
275 280 285 



His Gin Ser Gly Ala Lys Glu Lys Lys Gin His Asn Glu His Ala Ala 
290 295 300 



Ala Gin Tyr Ala Ala Arg Ser Lys Glu Thr Asp Arg Lys Gin Pro Val 
305 310 315 320 



Gly Asp Asn Gin Gly Glu Thr Lys Pro Pro Gly Arg Lys Arg Glu Gly 
325 330 335 



Glu Glu Arg Thr Ala Gly Pro Asn Lys Glu Arg Asn Ser Arg His 
340 345 350 



<210> 149 

<211> 223 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MISC^FEATXmE 

<222> (4).. (4) 

<223> Xs=any amino acid 



<400> 149 

Ala Phe Ala Xaa Gly Gly Glu Arg Gly Glu His Ala Met Lys Ala Ser 
15 10 15 



Gly Thr Leu Arg Glu Tyr Lys Val Val Gly Arg Cys Leu Pro Thr Pro 
20 25 30 
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Lys Cys His Thr Pro Pro Leu Tyr Arg Met Arg lie Phe Ala Pro Asn 
35 40 45 



His Val Val Ala Lys Ser Arg Phe Trp Tyr Phe Val Ser Gin Leu Lys 
50 55 60 



Lys Met Lys Lys Ser Ser Gly Glu He Val Tyr Cys Gly Gin Val Phe 
65 70 75 80 



Glu Lys Ser Pro Leu Arg Val Lys Asn Phe Gly He Trp Leu Arg Tyr 
85 90 95 



Asp Ser Arg Ser Gly Thr His Asn Met Tyr Arg Glu Tyr Arg Asp Leu 
100 105 110 



Thr Thr Ala Gly Ala Val Thr Gin Cys Tyr Arg Asp Met Gly Ala Arg 
115 120 125 



His Arg Ala Arg Ala His Ser He Gin He Met Lys Val Glu Glu He 
130 135 140 



Ala Ala Ser Lys Cys Arg Arg Pro Ala Val Lys Gin Phe His Asp Ser 
145 150 155 160 



Lys He Lys Phe Pro Leu Pro His Arg Val Leu Arg Arg Gin His Lys 
165 170 175 



Pro Arg Phe Thr Thr Lys Arg Pro Asn Asn Leu Leu Ser Arg Cys Arg 
180 185 190 



Ala Leu Val Arg Gly Val Pro Pro Asn Lys Leu Arg Asn Ala Pro Lys 
195 200 205 



Val Gin Ala Ala Val Pro Asp Asp Phe Lys Asp Phe Ser Leu Leu 
210 215 220 



<210> 150 

<211> 260 

<212> PRT 

<213> Homo sapien 

<400> 150 

Thr Ala Val Leu Ser Pro Gly Pro Arg Leu Pro Ser His Ser Ala Arg 
15 10 15 
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Cys Ala Cys Glu Gly Leu Ala Ala Leu Gly Thr Gly Gly Ala Ala Arg 
20 25 30 



Gly Val Arg Val Gly Val Arg Glu Gly Ser Thr Gin Asp Leu Arg Thr 
35 40 45 



Leu Leu Trp Gly Arg Thr Lys His Leu Pro Gly Ala Gly Gly Ala Pro 
50 55 60 



Gly Thr Arg Arg Phe Arg Gin Leu Gly Ala Leu Gly lie Cys Gly Leu 
65 70 75 80 



Arg Pro Gly Asp Gly Leu Gly Gly His Ala His Ala Leu Gly Leu Thr 
85 90 95 



Glu Cys Asp Arg Ala Arg Gly Arg Ala Lys Arg Gly Gly Arg Ala Arg 
100 105 110 



Arg Arg Lys Glu Gly Leu Val Arg Pro Ala Gin Pro Asp Gin Cys Arg 
115 120 125 



Gly Gly Asn Gly Leu Gly Ala Gly Pro lie Arg Ala Gly Gly Phe Leu 
130 135 140 



Arg Arg Arg Pro Ser Pro Gin Leu Leu Asp Cys Ser Gly Ala Gly Gly 
145 150 155 160 



Thr Asn Thr Trp Arg Phe Phe Arg Arg Gly Glu Asp Phe Leu Arg Ala 
165 170 175 



Gin Arg lie His Phe Leu His lie Asn Leu Ser Cys Trp Arg Asp Thr 
180 185 190 



Ala Gly Lys Arg Arg Pro lie Phe Val Gin Arg Thr Leu Asp Leu Gly 
195 200 205 



Arg Asn Lys Asp Asp Leu Asp Pro Cys Pro His Tyr Leu Glu Phe Ser 
210 215 220 



Met Leu Ala Lys lie Trp Thr Arg Ala Val Pro Glu Gly Arg Gly Pro 
225 230 235 240 



Trp Arg Glu Ala Pro Val Thr Ala His Pro Gly Val Gly Leu Trp Ala 
245 250 255 



Leu Leu Leu Cys 
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260 



<210> 151 

<211> 259 

<212> PRT 

<213> Homo sapien 

<400> 151 

Ser Arg Val Val Ala Arg Pro Arg Leu Pro Ser His Ser Ala Arg Cys 
15 10 15 



Ala Cys Glu Gly Leu Ala Ala Leu Gly Thr Gly Gly Ala Ala Arg Gly 
20 25 30 



Val Arg Val Gly Val Arg Glu Gly Ser Thr Gin Asp Leu Arg Thr Leu 
35 40 45 



Leu Trp Gly Arg Thr Lys His Leu Pro Gly Ala Gly Gly Ala Pro Gly 
50 55 60 



Thr Arg Arg Phe Arg Gin Leu Gly Ala Leu Gly He Cys Gly Leu Arg 
65 70 75 80 



Pro Gly Asp Gly Leu Gly Gly His Ala His Ala Leu Gly Leu Thr Glu 
85 90 95 



Cys Asp Arg Ala Arg Gly Arg Ala Lys Arg Gly Gly Arg Ala Arg Arg 
100 105 110 



Arg Lys Glu Gly Leu Val Arg Pro Ala Gin Pro Asp Gin Cys Arg Gly 
115 120 125 



Gly Asn Gly Leu Gly Ala Gly Pro He Arg Ala Gly Gly Phe Leu TVrg 
130 135 140 



Arg Arg Pro Ser Pro Gin Leu Leu Asp Cys Ser Gly Ala Gly Gly Thr 
145 150 155 160 



Asn Thr Trp Arg Phe Phe Arg Arg Gly Glu Asp Phe Leu Arg Ala Gin 
165 170 175 



Arg He His Phe Leu His He Asn Leu Ser Cys Trp Arg Asp Thr Ala 
180 185 190 



Gly Lys Arg Arg Pro He Phe Val Gin Arg Thr Leu Asp Leu Gly Arg 
195 200 205 
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Asn Lys Asp Asp Leu Asp Pro Cys Pro His Tyr Leu Glu Phe Ser Met 
210 215 220 



Leu Ala Lys He Trp Thr Arg Ala Val Pro Glu Gly Arg Gly Pro Trp 
225 230 235 240 



Arg Glu Ala Pro Val Thr Ala His Pro Gly Val Gly Leu Trp Ala Leu 
245 250 255 



Leu Leu Cys 



<210> 152 

<211> 650 

<212> PRT 

<213> Homo sapien 

<400> 152 

Met Gin Gin Asp Gly Leu Gly Val Gly Thr Arg Asn Gly Ser Gly Lys 
^5 10 15 



Gly Arg Ser Val His Pro Ser Trp Pro Trp Cys Ala Pro Arg Pro Leu 
20 25 30 



Arg Tyr Phe Gly Arg Asp Ala Arg Ala Arg Arg Ala Gin Thr Ala Ala 
35 40 45 



Met Ala Leu Leu Ala Gly Gly Leu Ser Arg Gly Leu Gly Ser His Pro 
50 55 60 



Ala Ala Ala Gly Arg Asp Ala Val Val Phe Val Trp Leu Leu Leu Ser 
^5 70 75 80 



Thr Trp Cys Thr Ala Pro Ala Arg Ala He Gin Val Thr Val Ser Asn 
85 90 95 



Pro Tyr His Val Val He Leu Phe Gin Pro Val Thr Leu Pro Cys Thr 
100 105 110 



Tyr Gin Met Thr Ser Thr Pro Thr Gin Pro He Val He Trp Lys Tyr 
115 120 125 



Lys Ser Phe Cys Arg Asp Arg He Ala Asp Ala Phe Ser Pro Ala Ser 
130 135 140 



Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr 
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145 



150 



155 



160 



Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg Thr Val Arg Val Val 
165 170 175 



Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly Asp Tyr Tyr Gin Gly 
180 185 190 



Arg Arg He Thr He Thr Gly Asn Ala Asp Leu Thr Phe Asp Gin Thr 
195 200 205 



Ala Trp Gly Asp Ser Gly Val Tyr Tyr Cys Ser Val Val Ser Ala Gin 
210 215 220 



Asp Leu Gin Gly Asn Asn Glu Ala Tyr Ala Glu Leu He Val Leu Gly 
225 230 235 240 



Arg Thr Ser Gly Val Ala Glu Leu Leu Pro Gly Phe Gin Ala Gly Pro 
245 250 255 



He Glu Asp Trp Leu Phe Val Val Val Val Cys Leu Ala Ala Phe Leu 
260 265 270 



He Phe Leu Leu Leu Gly He Cys Trp Cys Gin Cys Cys Pro His Thr 
275 280 285 



Cys Cys Cys Tyr Val Arg Cys Pro Cys Cys Pro Asp Lys Cys Cys Cys 
290 295 300 



Pro Glu Ala Leu Tyr Ala Ala Gly Lys Ala Ala Thr Ser Gly Val Pro 
305 310 315 320 



Ser He Tyr Ala Pro Ser Thr Tyr Ala His Leu Ser Pro Ala Lys Thr 
325 330 335 



Pro Pro Pro Pro Ala Met He Pro Met Gly Pro Ala Tyr Asn Gly Tyr 
340 345 350 



Pro Gly Gly Tyr Pro Gly Asp Val Asp Arg Ser Ser Ser Ala Gly Gly 
355 360 365 



Gin Gly Ser Tyr Val Pro Leu Leu Arg Asp Thr Asp Ser Ser Val Ala 
370 375 380 



Ser Glu Val Arg Ser Gly Tyr Arg He Gin Ala Ser Gin Gin Asp Asp 
385 390 395 400 
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Ser Met Arg Val Leu Tyr Tyr Met Glu Lys Glu Leu Ala Asn Phe Asp 
405 410 415 



Pro Ser Arg Pro Gly Pro Pro Ser Gly Arg Val Glu Arg Ala Met Ser 
420 425 430 



Glu Val Thr Ser Leu His Glu Asp Asp Trp Arg Ser Arg Pro Ser Arg 
435 440 445 



Gly Pro Ala Leu Thr Pro lie Arg Asp Glu Glu Trp Gly Gly His Ser 
450 455 460 



Pro Arg Ser Pro Arg Gly Trp Asp Gin Glu Pro Ala Arg Glu Gin Ala 
465 470 475 480 



Gly Gly Gly Trp Arg Ala Arg Arg Pro Arg Ala Arg Ser Val Asp Ala 
485 490 495 



Leu Asp Asp Leu Thr Pro Pro Ser Thr Ala Glu Ser Gly Ser Arg Ser 
500 505 510 



Pro Thr Ser Asn Gly Gly Arg Arg Ser Arg Ala Tyr Met Pro Pro Arg 
515 520 525 



Ser Arg Ser Arg Asp Asp Leu Tyr Asp Gin Asp Asp Ser Arg Asp Phe 
530 535 540 



Pro Arg Ser Arg Asp Pro His Tyr Asp Asp Phe Arg Ser Arg Glu Arg 
545 550 555 560 



Pro Pro Ala Asp Pro Arg Ser His His His Arg Thr Arg Asp Pro Arg 
565 570 575 



Asp Asn Gly Ser Arg Ser Gly Asp Leu Pro Tyr Asp Gly Arg Leu Leu 
580 585 590 



Glu Glu Ala Val Arg Lys Lys Gly Ser Glu Glu Arg Arg Arg Pro His 
595 600 605 



Lys Glu Glu Glu Glu Glu Ala Tyr Tyr Pro Pro Ala Pro Pro Pro Tyr 
610 615 620 



Ser Glu Thr Asp Ser Gin Ala Ser Arg Glu Arg Arg Leu Lys Lys Asn 
625 630 635 640 
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Leu Ala Leu Ser Arg Glu Ser Leu Val Val 
645 650 



<210> 153 

<211> 388 

<212> PRT 

<213> Homo sapien 

<400> 153 

Met Ser Lys Glu Ala Leu Gin Arg Arg Gly Arg Leu Gly Lys Glu Val 
15 10 15 



Gin Ala Gin Val Pro Pro Glu Pro Asn Gly Tyr Gly Ala Ala Trp Leu 
20 25 30 



Leu Pro His Pro Pro Ser Pro Val Asp Cys Val Leu Thr Val Tyr Ala 
35 40 45 



Ala Gly Lys Ala Ala Thr Ser Gly Val Pro Ser lie Tyr Ala Pro Ser 
50 55 60 



Thr Tyr Ala His Leu Ser Pro Ala Lys Thr Pro Pro Pro Pro Ala Met 
65 70 75 80 



lie Pro Met Gly Pro Ala Tyr Asn Gly Tyr Pro Gly Gly Tyr Pro Gly 
85 90 95 



Asp Val Asp Arg Ser Ser Ser Ala Gly Gly Gin Gly Ser Tyr Val Pro 
100 105 110 



Leu Leu Arg Asp Thr Asp Ser Ser Val Ala Ser Glu Val Arg Ser Gly 
115 120 125 



Tyr Arg lie Gin Ala Ser Gin Gin Asp Asp Ser Met Arg Val Leu Tyr 
130 135 140 



Tyr Met Glu Lys Glu Leu Ala Asn Phe Asp Pro Ser Arg Pro Gly Pro 
145 150 155 160 



Pro Ser Gly Arg Val Glu Arg Ala Met Ser Glu Val Thr Ser Leu His 
165 170 175 



Glu Asp Asp Trp Arg Ser Arg Pro Ser Arg Gly Pro Ala Leu Thr Pro 
180 185 190 



He Arg Asp Glu Glu Trp Gly Gly His Ser Pro Arg Ser Pro Arg Gly 
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195 200 205 



Trp Asp Gin Glu Pro Ala Arg Glu Gin Ala Gly Gly Gly Trp Arg Ala 
210 215 220 



Arg Arg Pro Arg Ala Arg Ser Val Asp Ala Leu Asp Asp Leu Thr Pro 
225 230 235 240 



Pro Ser Thr Ala Glu Ser Gly Ser Arg Ser Pro Thr Ser Asn Gly Gly 
245 250 255 



Arg Arg Ser Arg Ala Tyr Met Pro Pro Arg Ser Arg Ser Arg Asp Asp 
260 265 270 



Leu Tyr Asp Gin Asp Asp Ser Arg Asp Phe Pro Arg Ser Arg Asp Pro 
275 280 285 



His Tyr Asp Asp Phe Arg Ser Arg Glu Arg Pro Pro Ala Asp Pro Arg 
290 295 300 



Ser His His His Arg Thr Arg Asp Pro Arg Asp Asn Gly Ser Arg Ser 
305 310 315 320 



Gly Asp Leu Pro Tyr Asp Gly Arg Leu Leu Glu Glu Ala Val Arg Lys 
325 330 335 



Lys Gly Ser Glu Glu Arg Arg Arg Pro His Lys Glu Glu Glu Glu Glu 
340 345 350 



Ala Tyr Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin 
355 360 365 



Ala Ser Arg Glu Arg Arg Leu Lys Lys Asn Leu Ala Leu Ser Arg Glu 
370 375 380 



Ser Leu Val Val 
385 



<210> 154 

<211> 83 

<212> PRT 

<213> Homo sapien 

<400> 154 

Met Lys Pro Gly Glu Gly Gly Gin Val Ala Pro Ser Leu Pro Gly Ser 
15 10 15 
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Gly Gin Thr Cys Leu Glu Ser Gin Gly Arg Thr Arg Ser Ser Asn Pro 
20 25 30 



Pro Thr Ala Pro Ser Arg Leu Pro Ala Arg Pro Thr Ser His Ser Leu 
35 40 45 



Gly Ser His Gly Ala Asp Arg Pro Arg Arg Glu His Thr Pro Pro Val 
50 55 60 



Cys Ala Leu Ser Arg Ser Gin Arg Pro Arg Gly His Arg Ala Met His 
65 70 75 80 



Ala Pro Asn 



<210> 155 

<21l> 379 

<212> PRT 

<213> Homo sapien 

<400> 155 

Ala Ser His Leu Leu Pro Gin Ala Pro Thr Ala Ser Pro Cys Val Leu 
15 10 15 



Gin Glu Thr Tyr Lys Leu Pro His Arg Leu lie Glu Lys Lys Arg Arg 
20 25 30 



Asp Arg lie Asn Glu Cys lie Ala Gin Leu Lys Asp Leu Leu Pro Glu 
35 40 45 



His Leu Lys Leu Thr Thr Leu Gly His Leu Glu Lys Ala Val Val Leu 
50 55 60 



Glu Leu Thr Leu Lys His Val Lys Ala Leu Thr Asn Leu lie Asp Gin 
65 70 75 80 



Gin Gin Gin Lys lie lie Ala Leu Gin Ser Gly Leu Gin Ala Gly Glu 
85 90 95 



Leu Ser Gly Arg Asn Val Glu Thr Gly Gin Glu Met Phe Cys Ser Gly 
100 105 110 



Phe Gin Thr Cys Ala Arg Glu Val Leu Gin Tyr Leu Ala Lys His Glu 
115 120 125 



Asn Thr Arg Asp Leu Lys Ser Ser Gin Leu Val Thr His Leu His Arg 
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191 

135 140 



Val val Ser Glu Leu Leu Gin Gly Gly Thr Ser Arg Lys Pro Ser Asp 
145 150 155 160 



Pro Ala Pro Lys Val Met Asp Phe Lys Glu Lys Pro Ser Ser Pro Ala 
165 170 175 



Lys Gly Ser Glu Gly Pro Gly Lys Asn Cys Val Pro Val He Gin Arg 
180 185 190 



Thr Phe Ala His Ser Ser Gly Glu Gin Ser Gly Ser Asp Thr Asp Thr 
195 200 205 



Asp Ser Gly Tyr Gly Gly Glu Ser Glu Lys Gly Asp Leu Arg Ser Glu 
210 215 220 



Gin Pro Cys Phe Lys Ser Asp His Gly Arg Arg Phe Thr Met Gly Glu 
225 230 235 240 



Arg He Gly Ala He Lys Gin Glu Ser Glu Glu Pro Pro Thr Lys Lys 
245 250 255 



Asn Arg Met Gin Leu Ser Asp Asp Glu Gly His Phe Thr Ser Ser Asp 
260 265 270 



Leu He Ser Ser Pro Phe Leu Gly Pro His Pro His Gin Pro Pro Phe 
275 280 285 



Cys Leu Pro Phe Tyr Leu He Pro Pro Ser Ala Thr Ala Tyr Leu Pro 
290 295 300 



Met Leu Glu Lys Cys Trp Tyr Pro Thr Ser Val Pro Val Leu Tyr Pro 
305 310 315 320 



Gly Leu Asn Ala Ser Ala Ala Ala Leu Ser Ser Phe Met Asn Pro Asp 
325 330 335 



Lys He Ser Ala Pro Leu Leu Met Pro Gin Arg Leu Pro Ser Pro Leu 
340 345 350 



Pro Ala His Pro Ser Val Asp Ser Ser Val Leu Leu Gin Ala Leu Lys 
355 360 365 



Pro He Pro Pro Leu Asn Leu Glu Thr Lys Asp 
370 375 
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<210> 156 

<211> 379 

<212> PRT 

<213> Homo sapien 

<400> 156 

Ala Ser His Leu Leu Pro Gin Ala Pro Thr Ala Ser Pro Cys Val Leu 
15 10 15 



Gin Glu Thr Tyr Lys Leu Pro His Arg Leu lie Glu Lys Lys Arg Arg 
20 25 30 



Asp Arg lie Asn Glu Cys lie Ala Gin Leu Lys Asp Leu Leu Pro Glu 
35 40 45 



His Leu Lys Leu Thr Thr Leu Gly His Leu Glu Lys Ala Val Val Leu 
50 55 60 



Glu Leu Thr Leu Lys His Val Lys Ala Leu Thr Asn Leu lie Asp Gin 
65 70 75 80 



Gin Gin Gin Lys He He Ala Leu Gin Ser Gly Leu Gin Ala Gly Glu 
85 90 95 



Leu Ser Gly Arg Asn Val Glu Thr Gly Gin Glu Met Phe Cys Ser Gly 
100 105 110 



Phe Gin Thr Cys Ala Arg Glu Val Leu Gin Tyr Leu Ala Lys His Glu 
115 120 125 



Asn Thr Arg Asp Leu Lys Ser Ser Gin Leu Val Thr His Leu His Arg 
130 135 140 



Val Val Ser Glu Leu Leu Gin Gly Gly Thr Ser Arg Lys Pro Ser Asp 
145 150 155 160 



Pro Ala Pro Lys Val Met Asp Phe Lys Glu Lys Pro Ser Ser Pro Ala 
165 170 175 



Lys Gly Ser Glu Gly Pro Gly Lys Asn Cys Val Pro Val He Gin Arg 
180 185 190 



Thr Phe Ala His Ser Ser Gly Glu Gin Ser Gly Ser Asp Thr Asp Thr 
195 200 205 
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Asp Ser Gly Tyr Gly Gly Glu Ser Glu Lys Gly Asp Leu Arg Ser Glu 
210 215 220 



Gin Pro Cys Phe Lys Ser Asp His Gly Arg Arg Phe Thr Met Gly Glu 
225 230 235 240 



Arg lie Gly Ala He Lys Gin Glu Ser Glu Glu Pro Pro Thr Lys Lys 
245 250 255 



Asn Arg Met Gin Leu Ser Asp Asp Glu Gly His Phe Thr Ser Ser Asp 
260 265 270 



Leu lie Ser Ser Pro Phe Leu Gly Pro His Pro His Gin Pro Pro Phe 
275 280 285 



Cys Leu Pro Phe Tyr Leu He Pro Pro Ser Ala Thr Ala Tyr Leu Pro 
290 295 300 



Met Leu Glu Lys Cys Trp Tyr Pro Thr Ser Val Pro Val Leu Tyr Pro 
305 310 315 320 



Gly Leu Asn Ala Ser Ala Ala Ala Leu Ser Ser Phe Met Asn Pro Asp 
325 330 335 



Lys He Ser Ala Pro Leu Leu Met Pro Gin Arg Leu Pro Ser Pro Leu 
340 345 350 



Pro Ala His Pro Ser Val Asp Ser Ser Val Leu Leu Gin Ala Leu Lys 
355 360 365 



Pro He Pro Pro Leu Asn Leu Glu Thr Lys Asp 
370 375 



<210> 157 

<211> 358 

<212> PRT 

<213> Homo sapien 

<400> 157 

Met Lys Pro Gly Glu Gly Gly Gin Val Ala Pro Ser Leu Pro Gly Ser 
15 10 15 



Gly Gin Thr Cys Leu Glu Ser Gin Gly Arg Thr Arg Ser Ser Asn Pro 
20 25 30 



Pro Thr Ala Pro Ser Arg Leu Pro Ala Leu Pro His Phe Ser Phe Thr 
35 40 45 
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Trp Leu Ala Arg Arg Arg Gin Thr Ala Gin Gly Ala His Thr Ala Ser 
50 55 60 



Leu Cys Ala Glu Ser Glu Pro Glu Ala Ala Gly Thr Pro Gly His Ala 
65 70 75 80 



Arg Pro Gin Leu Lys Leu His Leu Lys Ala Glu Asp Ser Ser Ser Pro 
85 90 95 



Gly Asp Phe Lys Glu Leu Arg Leu Arg Gly Thr Ser Ala Glu Arg Pro 
100 105 110 



Pro Lys Pro Ser Pro Gly Gin Ser Ser Ser Arg Arg Ser Ala Ser Ala 
115 120 125 



Asp Arg Se'r Ala Gin Trp Pro Arg Leu Ala Ala Pro Trp Ser Gly Ser 
130 135 140 



Pro Ala Arg Asn His Pro Pro Pro Ala Cys Pro Lys His Arg Asp Trp 
145 150 155 160 



Ser Thr Glu Thr Tyr Gin Gly Lys Leu Ala Leu Leu Gly Pro Ser Ser 
165 170 175 



Leu Asn Cys Ser Pro Met Leu Cys Ala Thr Leu Asn Leu Glu Gin Leu 
180 185 190 



Arg Ala His Arg Glu Val Leu Ala Arg Gin Asn Ala Cys Ser Arg Ala 
195 200 205 



Gin Ala Val Thr Thr Leu Pro Gly Leu Ser Ser Cys Arg Met Tyr Pro 
210 215 220 



Ala His Met Tyr Gin Val Tyr Lys Ser Arg Arg Gly He Lys Arg Ser 
225 230 235 240 



Glu Asp Ser Lys Val Ser Lys Cys Thr Pro Arg Asp Pro Ala Leu Ser 
245 250 255 



Pro Ser Arg Ala Leu Ser Phe Gin Glu Lys Phe Ser Arg Phe Glu Val 
260 265 270 



Gly Glu Gly Met Gin Gly Val Gly Ser Val Pro Leu Leu Ser Asp Leu 
275 280 285 



wo 2004/050860 



PCT/US2003/040063 



195 

Glu Lys Lys Gly Gin Thr Met Val Leu Gly Ala Thr Leu Leu Leu Cys 
290 295 300 



Ser Ser Ala Gly Leu Leu Leu Arg Gly Trp Glu Asp Arg Leu Leu lie 
305 310 315 320 



Ser Phe Pro Lys Arg Pro Pro Pro Pro Arg Ala Ser Cys Arg Arg Pro 
325 330 335 



Thr Asn Cys Arg Thr Gly Ser Ser Arg Lys Arg Asp Val Thr Gly Leu 
340 345 350 



Thr Ser Ala Ser Pro Ser 





355 


<210> 


158 


<211> 


329 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


158 



Leu Gin Pro Thr His Arg Ser Leu Pro Pro Pro Arg Pro Pro His Phe 
15 10 15 



Ser Phe Thr Trp Leu Ala Arg Arg Arg Gin Thr Ala Gin Gly Ala His 
20 25 30 



Thr Ala Ser Leu Cys Ala Glu Ser Glu Pro Glu Ala Ala Gly Thr Pro 
35 40 45 



Gly His Ala Arg Pro Gin Leu Lys Leu His Leu Lys Ala Glu Asp Ser 
50 55 60 



Ser Ser Pro Gly Asp Phe Lys Glu Leu Arg Leu Arg Gly Thr Ser Ala 
65 70 75 80 



Glu Arg Pro Pro Lys Pro Ser Pro Gly Gin Ser Ser Ser Arg Arg Ser 
85 90 95 



Ala Ser Ala Asp Arg Ser Ala Gin Trp Pro Arg Leu Ala Ala Pro Trp 
100 105 110 



Ser Gly Ser Pro Ala Arg Asn His Pro Pro Pro Ala Cys Pro Lys His 
115 120 125 



Arg Asp Trp Ser Thr Glu Thr Tyr Gin Gly Lys Leu Ala Leu Leu Gly 
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130 135 140 



Pro Ser Ser Leu Asn Cys Ser Pro Met Leu Cys Ala Thr Leu Asn Leu 
145 150 155 160 



Glu Gin Leu Arg Ala His Arg Glu Val Leu Ala Arg Gin Asn Ala Cys 
165 170 175 



Ser Arg Ala Gin Ala Val Thr Thr Leu Pro Gly Leu Ser Ser Cys Arg 
180 185 190 



Met Tyr Pro Ala His Met Tyr Gin Val Tyr Lys Ser Arg Arg Gly lie 
195 200 205 



Lys Arg Ser Glu Asp Ser Lys Val Ser Lys Cys Thr Pro Arg Asp Pro 
210 215 220 



Ala Leu Ser Pro Ser Arg Ala Leu Ser Phe Gin Glu Lys Phe Ser Arg 
225 230 235 240 



Phe Glu Val Gly Glu Gly Met Gin Gly Val Gly Ser Val Pro Leu Leu 
245 250 255 



Ser Asp Leu Glu Lys Lys Gly Gin Thr Met Val Leu Gly Ala Thr Leu 
260 265 270 



Leu Leu Cys Ser Ser Ala Gly Leu Leu Leu Arg Gly Trp Glu Asp Arg 
275 280 285 



Leu Leu lie Ser Phe Pro Lys Arg Pro Pro Pro Pro Arg Ala Ser Cys 
290 295 300 



Arg Arg Pro Thr Asn Cys Arg Thr Gly Ser Ser Arg Lys Arg Asp Val 
305 310 315 320 



Thr Gly Leu Thr Ser Ala Ser Pro Ser 
325 



<210> 159 

<211> 425 

<212> PRT 

<213> Homo sapien 

<400> 159 

Cys Arg Gin Glu Arg Ala Val Ala Pro Ala Arg Arg Ala Met Glu Arg 
15 10 15 
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lie Pro Ser Ala Gin Pro Pro Pro Ala Cys Leu Pro Lys Ala Pro Gly 
20 25 30 



Leu Glu His Gly Asp Leu Pro Gly Met Tyr Pro Ala His Met Tyr Gin 
35 40 45 



Val Tyr Lys Ser Arg Arg Gly lie Lys Arg Ser Glu Asp Ser Lys Glu 
50 55 60 



Thr Tyr Lys Leu Pro His Arg Leu lie Glu Lys Lys Arg Arg Asp Arg 
65 70 75 80 



lie Asn Glu Cys lie Ala Gin Leu Lys Asp Leu Leu Pro Glu His Leu 
85 90 95 



Lys Leu Thr Thr Leu Gly His Leu Glu Lys Ala Val Val Leu Glu Leu 
100 105 110 



Thr Leu Lys His Val Lys Ala Leu Thr Asn Leu lie Asp Gin Gin Gin 
115 120 125 



Gin Lys lie lie Ala Leu Gin Ser Gly Leu Gin Ala Gly Glu Leu Ser 
130 135 140 



Gly Arg Asn Val Glu Thr Gly Gin Glu Met Phe Cys Ser Gly Phe Gin 
145 150 155 160 



Thr Cys Ala Arg Glu Val Leu Gin Tyr Leu Ala Lys His Glu Asn Thr 
165 170 175 



Arg Asp Leu Lys Ser Ser Gin Leu Val Thr His Leu His Arg Val Val 
180 185 190 



Ser Glu Leu Leu Gin Gly Gly Thr Ser Arg Lys Pro Ser Asp Pro Ala 
195 200 205 



Pro Lys Val Met Asp Phe Lys Glu Lys Pro Ser Ser Pro Ala Lys Gly 
210 215 220 



Ser Glu Gly Pro Gly Lys Asn Cys Val Pro Val He Gin Arg Thr Phe 
225 230 235 240 



Ala His Ser Ser Gly Glu Gin Ser Gly Ser Asp Thr Asp Thr Asp Ser 
245 250 255 
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Gly Tyr Gly Gly Glu Ser Glu Lys Gly Asp Leu Arg Ser Glu Gin Pro 
260 265 270 



Cys Phe Lys Ser Asp His Gly Arg Arg Phe Thr Met Gly Glu Arg He 
275 280 285 



Gly Ala He Lys Gin Glu Ser Glu Glu Pro Pro Thr Lys Lys Asn Arg 
290 295 300 



Met Gin Leu Ser Asp Asp Glu Gly His Phe Thr Ser Ser Asp Leu He 
305 310 315 320 



Ser Ser Pro Phe Leu Gly Pro His Pro His Gin Pro Pro Phe Cys Leu 
325 330 335 



Pro Phe Tyr Leu He Pro Pro Ser Ala Thr Ala Tyr Leu Pro Met Leu 
340 345 350 



Glu Lys Cys Trp Tyr Pro Thr Ser Val Pro Val Leu Tyr Pro Gly Leu 
355 360 365 



Asn Ala Ser Ala Ala Ala Leu Ser Ser Phe Met Asn Pro Asp Lys He 
370 375 380 



Ser Ala Pro Leu Leu Met Pro Gin Arg Leu Pro Ser Pro Leu Pro Ala 
385 390 395 400 



His Pro Ser Val Asp Ser Ser Val Leu Leu Gin Ala Leu Lys Pro He 
405 410 415 



Pro Pro Leu Asn Leu Glu Thr Lys Asp 
420 425 



<210> 160 

<211> 425 

<212> PRT 

<213> Homo sapien 

<400> 160 

Cys Arg Gin Glu Arg Ala Val Ala Pro Ala Arg Arg Ala Met Glu Arg 
15 10 15 



He Pro Ser Ala Gin Pro Pro Pro Ala Cys Leu Pro Lys Ala Pro Gly 
20 25 30 



Leu Glu His Gly Asp Leu Pro Gly Met Tyr Pro Ala His Met Tyr Gin 
35 40 45 
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Val Tyr Lys Ser Arg Arg Gly lie Lys Arg Ser Glu Asp Ser Lys Glu 
50 55 60 



Thr Tyr Lys Leu Pro His Arg Leu lie Glu Lys Lys Arg Arg Asp Arg 
65 70 75 80 



He Asn Glu Cys lie Ala Gin Leu Lys Asp Leu Leu Pro Glu His Leu 
85 90 95 



Lys Leu Thr Thr Leu Gly His Leu Glu Lys Ala Val Val Leu Glu Leu 
100 105 110 



Thr Leu Lys His Val Lys Ala Leu Thr Asn Leu lie Asp Gin Gin Gin 
115 120 125 



Gin Lys He He Ala Leu Gin Ser Gly Leu Gin Ala Gly Glu Leu Ser 
130 135 140 



Gly Arg Asn Val Glu Thr Gly Gin Glu Met Phe Cys Ser Gly Phe Gin 
145 150 155 160 



Thr Cys Ala Arg Glu Val Leu Gin Tyr Leu Ala Lys His Glu Asn Thr 
165 170 175 



Arg Asp Leu Lys Ser Ser Gin Leu Val Thr His Leu His Arg Val Val 
180 IBS 190 



Ser Glu Leu Leu Gin Gly Gly Thr Ser Arg Lys Pro Ser Asp Pro Ala 
195 200 205 



Pro Lys Val Met Asp Phe Lys Glu Lys Pro Ser Ser Pro Ala Lys Gly 
210 215 220 



Ser Glu Gly Pro Gly Lys Asn Cys Val Pro Val He Gin Arg Thr Phe 
225 230 235 240 



Ala His Ser Ser Gly Glu Gin Ser Gly Ser Asp Thr Asp Thr Asp Ser 
245 250 . 255 



Gly Tyr Gly Gly Glu Ser Glu Lys Gly Asp Leu Arg Ser Glu Gin Pro 
260 265 270 



Cys Phe Lys Ser Asp His Gly Arg Arg Phe Thr Met Gly Glu Arg He 
275 280 285 
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Gly Ala lie Lys Gin Glu Ser Glu Glu Pro Pro Thr Lys Lys Asn Arg 
290 295 300 



Met Gin Leu Ser Asp Asp Glu Gly His Phe Thr Ser Ser Asp Leu He 
305 310 315 320 



Ser Ser Pro Phe Leu Gly Pro His Pro His Gin Pro Pro Phe Cys Leu 
325 330 335 



Pro Phe Tyr Leu He Pro Pro Ser Ala Thr Ala Tyr Leu Pro Met Leu 
340 345 350 



Glu Lys Cys Trp Tyr Pro Thr Ser Val Pro Val Leu Tyr Pro Gly Leu 
355 360 365 



Asn Ala Ser Ala Ala Ala Leu Ser Ser Phe Met Asn Pro Asp Lys He 
370 375 380 



Ser Ala Pro Leu Leu Met Pro Gin Arg Leu Pro Ser Pro Leu Pro Ala 
385 390 395 400 



His Pro Ser Val Asp Ser Ser Val Leu Leu Gin Ala Leu Lys Pro He 
405 410 415 



Pro Pro Leu Asn Leu Glu Thr Lys Asp 
420 425 



<210> 161 

<211> 64 

<212> PRT 

<213> Homo sapien 

<400> 161 

His Val Leu Glu Leu Leu Pro Gly Gin Leu Glu Gin Asp Asp Ser Gly 
15 10 15 



Pro Gly Val Thr Ser Gly Gin Cys Ala Gly Val Lys Asp Leu Thr Gly 
20 25 30 



Leu Arg Arg Asp Leu Arg Phe Arg Pro Gly Ser Gly Ala Val Lys Leu 
35 40 45 



Pro Val Glu Leu Ala Leu Ala Phe Arg Asn Ser Ser Ser Phe Cys Arg 
50 55 60 



<210> 162 
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<211> 111 

<212> PRT 

<213> Homo sapien 

<400> 162 

Asn Phe Lys Gin Ala Val Ser Thr Gly Leu Asn Ser Pro His Pro His 
15 10 15 



Gin Pro Pro Phe Cys Leu Pro Phe Tyr Leu lie Pro Pro Ser Ala Thr 
20 25 30 



Ala Tyr Leu Pro Met Leu Glu Lys Cys Trp Tyr Pro Thr Ser Val Pro 
35 40 45 



Val Leu Tyr Pro Gly Leu Asn Ala Ser Ala Ala Ala Leu Ser Ser Phe 
50 55 60 



Met Asn Pro Asp Lys lie Ser Ala Pro Leu Leu Met Pro Gin Arg Leu 
65 70 75 80 



Pro Ser Pro Leu Pro Ala His Pro Ser Val Asp Ser Ser Val Leu Leu 
85 90 95 



Gin Ala Leu Lys Pro lie Pro Pro Leu Asn Leu Glu Thr Lys Asp 
100 105 110 



<210> 163 

<211> 145 

<212> PRT 

<213> Homo sapien 

<400> 163 

Met Gly Lys Ser Arg Cys Pro Glu Gly Phe Pro lie Ala Glu Val Phe 
15 10 15 



Thr Leu Lys Pro Leu Glu Phe Gly Lys Pro Asn Thr Leu Val Cys Phe 
20 25 30 



Val Ser Asn Leu Phe Pro Pro Met Leu Thr Val Asn Trp Gin His His 
35 40 45 



Ser Val Pro Val Glu Gly Phe Gly Pro Thr Phe Val Ser Ala Val Asp 
50 55 60 



Gly Leu Ser Phe Gin Ala Phe Ser Tyr Leu Asn Phe Thr Pro Glu Pro 
65 70 75 80 
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Ser Asp lie Phe Ser Cys He Val Thr His Glu He Asp Arg Tyr Thr 
85 90 95 



Ala He Ala Tyr Trp Val Pro Arg Asn Ala Leu Pro Ser Asp Leu Leu 
100 105 110 



Glu Asn Val Leu Cys Gly Val Ala Phe Gly Leu Gly Val Leu Gly He 
115 120 125 



He Val Gly He Val Leu He He Tyr Phe Arg Lys Pro Cys Ser Gly 
130 135 140 



Asp 
145 



<210> 164 

<211> 270 

<212> PRT 

<213> Homo sapien 

c400> 164 

Leu Leu Pro Thr Val Trp Gin Glu Gly Met Gly His Glu Gin Asn Gin 
15 10 15 



Gly Ala Ala Leu Leu Gin Met Leu Pro Leu Leu Trp Leu Leu Pro His 
20 25 30 



Ser Trp Ala Val Pro Glu Ala Pro Thr Pro Met Trp Pro Asp Asp Leu 
35 40 45 



Gin Asn His Thr Phe Leu His Thr Val Tyr Cys Gin Asp Gly Ser Pro 
50 55 60 



Ser Val Gly Leu Ser Glu Ala Tyr Asp Glu Asp Gin Leu Phe Phe Phe 
65 70 75 80 



Asp Phe Ser Gin Asn Thr Arg Val Pro Arg Leu Pro Glu Phe Ala Asp 
85 90 95 



Trp Ala Gin Glu Gin Gly Asp Ala Pro Ala He Leu Phe Asp Lys Glu 
100 105 110 



Phe Cys Glu Trp Met He Gin Gin He Gly Pro Lys Leu Asp Gly Lys 
115 120 125 



He Pro Val Ser Arg Gly Phe Pro He Ala Glu Val Phe Thr Leu Lys 
130 135 140 
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Pro Leu Glu Phe Gly Lys Pro Asn Thr Leu Val Cys Phe Val Ser Asn 
145 150 155 160 



Leu Phe Pro Pro Met Leu Thr Val Asn Trp Gin His His Ser Val Pro 
165 170 175 



Val Glu Gly Phe Gly Pro Thr Phe Val Ser Ala Val Asp Gly Leu Ser 
180 185 190 



Phe Gin Ala Phe Ser Tyr Leu Asn Phe Thr Pro Glu Pro Ser Asp lie 
195 200 205 



Phe Ser Cys lie Val Thr His Glu He Asp Arg Tyr Thr Ala He Ala 
210 215 220 



Tyr Trp Val Pro Arg Asn Ala Leu Pro Ser Asp Leu Leu Glu Asn Val 
225 230 235 240 



Leu Cys Gly Val Ala Phe Gly Leu Gly Val Leu Gly He lie Val Gly 
245 250 255 



He Val Leu He He Tyr Phe Arg Lys Pro Cys Ser Gly Asp 
260 265 270 



<210> 165 

<211> 180 

<212> PRT 

<213> Homo sapien 

<400> 165 

His Ser Gly Leu Phe Leu Cys Leu Phe Val Ala Glu Leu Glu Pro Ala 
IS 10 15 



He Leu Phe Asp Lys Glu Phe Cys Glu Trp Met He Gin Gin He Gly 
20 25 30 



Pro Lys Leu Asp Gly Lys He Pro Val Ser Arg Gly Phe Pro He Ala 
35 40 45 



Glu Val Phe Thr Leu Lys Pro Leu Glu Phe Gly Lys Pro Asn Thr Leu 
50 55 60 



Val Cys Phe Val Ser Asn Leu Phe Pro Pro Met Leu Thr Val Asn Trp 
65 70 75 80 
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Gin His His Ser Val Pro Val Glu Gly Phe Gly Pro Thr Phe Val Ser 
85 90 95 



Ala Val Asp Gly Leu Ser Phe Gin Ala Phe Ser Tyr Leu Asn Phe Thr 
100 ICS 110 



Pro Glu Pro Ser Asp lie Phe Ser Cys He Val Thr His Glu He Asp 
115 120 125 



Arg Tyr Thr Ala He Ala Tyr Trp Val Pro Arg Asn Ala Leu Pro Ser 
130 135 140 



Asp Leu Leu Glu Asn Val Leu Cys Gly Val Ala Phe Gly Leu Gly Val 
145 150 155 160 



Leu Gly He He Val Gly He Val Leu He He Tyr Phe Arg Lys Pro 
165 170 175 



Cys Ser Gly Asp 
180 



<210> 166 

<211> 796 

<212> PRT 

<213> Homo sapien 

<400> 166 

Met Ser Leu Asp Asp Asn Leu Ser Gly Thr Ser Gly Met Glu Val Asp 
15 10 15 



Asp Arg Val Ser Ala Leu Glu Gin Arg Leu Gin Leu Gin Glu Asp Glu 
20 25 30 



Leu Ala Val Leu Lys Ala Ala Leu Ala Asp Ala Leu Arg Arg Leu Arg 
35 40 45 



Ala Cys Glu Glu Gin Gly Ala Ala Leu Arg Ala Arg Gly Thr Pro Lys 
50 55 60 



Gly Arg Ala Pro Pro Arg Leu Gly Thr Thr Ala Ser Val Cys Gin Leu 
65 70 75 80 



Leu Lys Gly Leu Pro Thr Arg Thr Pro Leu Asn Gly Ser Gly Pro Pro 
85 90 95 



Arg Arg Val Gly Gly Tyr Ala Thr Ser Pro Ser Ser Pro Lys Lys Glu 
100 105 HO 
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Ala Thr Ser Gly Arg Ser Ser Val Arg Arg Tyr Leu Ser Pro Glu Arg 
115 120 125 



Leu Ala Ser Val Arg Arg Glu Asp Pro Arg Ser Arg Thr Thr Ser Ser 
130 135 140 



Ser Ser Asn Cys Ser Ala Lys Lys Glu Gly Lys Thr Lys Glu Val He 
145 150 155 160 



Phe Ser Val Glu Asp Gly Ser Val Lys Met Phe Leu Arg Gly Arg Pro 
165 170 175 



Val Pro Met Met He Pro Asp Glu Leu Ala Pro Thr Tyr Ser Leu Asp 
180 185 190 



Thr Arg Ser Glu Leu Pro Ser Cys Arg Leu Lys Leu Glu Trp Val Tyr 
195 200 205 



Gly Tyr Arg Gly Arg Asp Cys Arg Ala Asn Leu Tyr Leu Leu Pro Thr 
210 215 220 



Gly Glu He Val Tyr Phe Val Ala Ser Val Ala Val Leu Tyr Ser Val 
225 230 235 240 



Glu Glu Gin Arg Gin Arg His Tyr Leu Gly His Asn Asp Asp He Lys 
245 250 255 



Cys Leu Ala He His Pro Asp Met Val Thr He Ala Thr Gly Gin Val 
260 265 270 



Ala Gly Thr Thr Lys Glu Gly Lys Pro Leu Pro Pro His Val Arg He 
275 280 285 



Trp Asp Ser Val Ser Leu Ser Thr Leu His Val Leu Gly Leu Gly Val 
290 295 300 



Phe Asp Arg Ala Val Cys Cys Val Gly Phe Ser Lys Ser Asn Gly Gly 
305 310 315 320 



Asn Leu Leu Cys Ala Val Asp Glu Ser Asn Asp His Met Leu Ser val 
325 330 335 



Trp Asp Trp Ala Lys Glu Thr Lys Val Val Asp Val Lys Cys Ser Asn 
340 345 350 
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Glu Ala Val Leu Val Ala Thr Phe His Pro Thr Asp Pro Thr Val Leu 
355 360 365 



lie Thr Cys Gly Lys Ser His He Tyr Phe Trp Thr Leu Glu Gly Gly 
370 375 380 



Ser Leu Ser Lys Arg Gin Gly Leu Phe Glu Lys His Glu Lys Pro Lys 
385 390 395 400 



Tyr Val Leu Cys Val Thr Phe Leu Glu Gly Gly Asp Val Val Thr Gly 
405 410 415 



Asp Ser Gly Gly Asn Leu Tyr Val Trp Gly Lys Gly Gly Asn Arg He 
420 425 430 



Thr Gin Ala Val Leu Gly Ala His Asp Gly Gly Val Phe Gly Leu Cys 
435 440 445 



Ala Leu Arg Asp Gly Thr Leu Val Ser Gly Gly Gly Arg Asp Arg Arg 
450 455 460 



Val Val Leu Trp Gly Ser Asp Tyr Ser Lys Leu Gin Glu Val Glu Val 
465 470 475 480 



Pro Glu Asp Phe Gly Pro Val Arg Thr Val Ala Glu Gly His Gly Asp 
485 490 495 



Thr Leu Tyr Val Gly Thr Thr Arg Asn Ser He Leu Gin Gly Ser Val 
500 505 510 



His Thr Gly Phe Ser Leu Leu Val Gin Gly His Val Glu Glu Leu Trp 
515 520 525 



Gly Leu Ala Thr His Pro Ser Arg Ala Gin Phe Val Thr Cys Gly Gin 
530 535 540 



Asp Lys Leu Val His Leu Trp Ser Ser Asp Ser His Gin Pro Leu Trp 
545 550 555 560 



Ser Arg He He Glu Asp Pro Ala Arg Ser Ala Gly Phe His Pro Ser 
565 570 575 



Gly Ser Val Leu Ala Val Gly Thr Val Thr Gly Arg Trp Leu Leu Leu 
560 585 590 
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Asp Thr Glu Thr His Asp Leu Val Ala He His Thr Asp Gly Asn Glu 
595 600 605 



Gin He Ser Val Val Ser Phe Ser Pro Asp Gly Ala Tyr Leu Ala Val 
610 615 620 



Gly Ser His Asp Asn Leu Val Tyr Val Tyr Thr Val Asp Gin Gly Gly 
625 630 635 640 



Arg Lys Val Ser Arg Leu Gly Lys Cys Ser Gly His Ser Ser Phe He 
645 650 655 



Thr His Leu Asp Trp Ala Gin Asp Ser Ser Cys Phe Val Thr Asn Ser 
660 665 670 



Gly Asp Tyr Glu He Leu Tyr Trp Asp Pro Ala Thr Cys Lys Gin He 
675 680 685 



Thr Ser Ala Asp Ala Val Arg Asn Met Glu Trp Ala Thr Ala Thr Cys 
690 695 700 



Val Leu Gly Phe Gly Val Phe Gly lie Trp Ser Glu Gly Ala Asp Gly 
705 710 715 720 



Thr Asp He Asn Ala Val Ala Arg Ser His Asp Gly Lys Leu Leu Ala 
725 730 735 



Ser Ala Asp Asp Phe Gly Lys Val His Leu Phe Ser Tyr Pro Cys Cys 
740 745 750 



Gin Pro Arg Ala Leu Ser His Lys Tyr Gly Gly His Ser Ser His Val 
755 760 765 



Thr Asn Val Ala Phe Leu Trp Asp Asp Ser Met Ala Leu Thr Thr Gly 
770 775 780 



Gly Lys Asp Thr Ser Val Leu Gin Trp Arg Val Val 
790 795 



785 
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Met Phe Leu Arg Gly Arg Pro Val Pro Met Met lie Pro Asp Glu Leu 
15 10 15 
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Ala Pro Thr Tyr Ser Leu Asp Thr Arg Ser Glu Leu Pro Ser Cys Arg 
20 25 30 



Leu Lys Leu Glu Trp Val Tyr Gly Tyr Arg Gly Arg Asp Cys Arg Ala 
35 40 45 



Asn Leu Tyr Leu Leu Pro Thr Gly Glu lie Val Tyr Phe Val Ala Ser 
50 55 60 



Val Ala Val Leu Tyr Ser Val Glu Glu Gin Arg Gin Arg His Tyr Leu 
65 70 75 80 



Gly His Asn Asp Asp lie Lys Cys Leu Ala lie His Pro Asp Met Val 
85 90 95 



Thr lie Ala Thr Gly Gin Val Ala Gly Thr Thr Lys Glu Gly Lys Pro 
100 105 110 



Leu Pro Pro His Val Arg lie Trp Asp Ser Val Ser Leu Ser Thr Leu 
115 120 125 



His Val Leu Gly Leu Gly Val Phe Asp Arg Ala Val Cys Cys Val Gly 
130 135 140 



Phe Ser Lys Ser Asn Gly Gly Asn Leu Leu Cys Ala Val Asp Glu Ser 
145 150 155 160 



Asn Asp His Met Leu Ser Val Trp Asp Trp Ala Lys Glu Thr Lys Val 
165 170 175 



Val Asp Val Lys Cys Ser Asn Glu Ala Val Leu Val Ala Thr Phe His 
180 185 190 



Pro Thr Asp Pro Thr Val Leu lie Thr Cys Gly Lys Ser His lie Tyr 
195 200 205 



Phe Trp Thr Leu Glu Gly Gly Ser Leu Ser Lys Arg Gin Gly Leu Phe 
210 215 220 



Glu Lys His Glu Lys Pro Lys Tyr Val Leu Cys Val Thr Phe Leu Glu 
225 230 235 240 



Gly Gly Asp Val Val Thr Gly Asp Ser Gly Gly Asn Leu Tyr Val Trp 
245 250 255 
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Gly Lys Gly Gly Asn Arg He Thr Gin Ala Val Leu Gly Ala His Asp 
260 265 270 



Gly Gly Val Phe Gly Leu Cys Ala Leu Arg Asp Gly Thr Leu Val Ser 
275 280 285 



Gly Gly Gly Arg Asp Arg Arg Val Val Leu Trp Gly Ser Asp Tyr Ser 
290 295 300 



Lys Leu Gin Glu Val Glu Val Pro Glu Asp Phe Gly Pro Val Arg Thr 
305 310 315 320 



Val Ala Glu Gly His Gly Asp Thr Leu Tyr Val Gly Thr Thr Arg Asn 
325 330 335 



Ser He Leu Gin Gly Ser Val His Thr Gly Phe Ser Leu Leu Val Gin 
340 345 350 



Gly His Val Glu Glu Leu Trp Gly Leu Ala Thr His Pro Ser Arg Ala 
355 360 365 



Gin Phe Val Thr Cys Gly Gin Asp Lys Leu Val His Leu Trp Ser Ser 
370 375 380 



Asp Ser His Gin Pro Leu Trp Ser Arg He He Glu Asp Pro Ala Arg 
385 390 395 400 



Ser Ala Gly Phe His Pro Ser Gly Ser Val Leu Ala Val Gly Thr Val 
405 410 415 



Thr Gly Arg Trp Leu Leu Leu Asp Thr Glu Thr His Asp Leu Val Ala 
420 425 430 



He His Thr Asp Gly Asn Glu Gin He Ser Val Val Ser Phe Ser Pro 
435 440 445 



Asp Gly Ala Tyr Leu Ala Val Gly Ser His Asp Asn Leu Val Tyr Val 
450 455 460 



Tyr Thr Val Asp Gin Gly Gly Arg Lys Val Ser Arg Leu Gly Lys Cys 
465 470 475 480 



Ser Gly His Ser Ser Phe He Thr His Leu Asp Trp Ala Gin Asp Ser 
485 490 495 
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Ser Cys Phe Val Thr Asn Ser Gly Asp Tyr Glu He Leu Tyr Trp Asp 
500 505 510 



Pro Ala Thr Cys Lys Gin He Thr Ser Ala Asp Ala Val Arg Asn Met 
515 520 525 



Glu Trp Ala Thr Ala Thr Cys Val Leu Gly Phe Gly Val Phe Gly He 
530 535 540 



Trp Ser Glu Gly Ala Asp Gly Thr Asp He Asn Ala Val Ala Arg Ser 
545 550 555 560 



His Asp Gly Lys Leu Leu Ala Ser Ala Asp Asp Phe Gly Lys Val His 
565 570 575 



Leu Phe Ser Tyr Pro Cys Cys Gin Pro Arg Ala Leu Ser His Lys Tyr 
580 585 590 



Gly Gly His Ser Ser His Val Thr Asn Val Ala Phe Leu Trp Asp Asp 
595 600 605 



Ser Met Ala Leu Thr Thr Gly Gly Lys Asp Thr Ser Val Leu Gin Trp 
610 615 620 



Arg Val Val 



625 
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Met Phe Leu Arg Gly Arg Pro Val Pro Met Met He Pro Asp Glu Leu 
15 10 15 



Ala Pro Thr Tyr Ser Leu Asp Thr Arg Ser Glu Leu Pro Ser Cys Arg 
20 25 30 



Leu Lys Leu Glu Trp Val Tyr Gly Tyr Arg Gly Arg Asp Cys Arg Ala 
35 40 45 



Asn Leu Tyr Leu Leu Pro Thr Gly Glu He Val Tyr Phe Val Ala Ser 
50 55 60 



Val Ala Val Leu Tyr Ser Val Glu Glu Gin Arg Gin Arg His Tyr Leu 
65 70 75 80 
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Gly His Asn Asp Asp lie Lys Cys Leu Ala lie His Pro Asp Met Val 
85 90 95 



Thr He Ala Thr Gly Gin Val Ala Gly Thr Thr Lys Glu Gly Lys Pro 
ICQ 105 110 



Leu Pro Pro His Val Arg He Trp Asp Ser Val Ser Leu Ser Thr Leu 
115 120 125 



His Val Leu Gly Leu Gly Val Phe Asp Arg Ala Val Cys Cys Val Gly 
130 135 140 



Phe Ser Lys Ser Asn Gly Gly Asn Leu Leu Cys Ala Val Asp Glu Ser 
145 150 155 160 



Asn Asp His Met Leu Ser Val Trp Asp Trp Ala Lys Glu Thr Lys Val 
165 170 175 



Val Asp Val Lys Cys Ser Asn Glu Ala Val Leu Val Ala Thr Phe His 
180 185 190 



Pro Thr Asp Pro Thr Val Leu He Thr Cys Gly Lys Ser His He Tyr 
195 200 205 



Phe Trp Thr Leu Glu Gly Gly Ser Leu Ser Lys Arg Gin Gly Leu Phe 
210 215 220 



Glu Lys His Glu Lys Pro Lys Tyr Val Leu Cys Val Thr Phe Leu Glu 
225 230 235 240 



Gly Gly Asp Val Val Thr Gly Asp Ser Gly Gly Asn Leu Tyr Val Trp 
245 250 255 



Gly Lys Gly Gly Asn Arg He Thr Gin Ala Val Leu Gly Ala His Asp 
260 265 270 



Gly Gly Val Phe Gly Leu Cys Ala Leu Arg Asp Gly Thr Leu Val Ser 
275 280 285 



Gly Gly Gly Arg Asp Arg Arg Val Val Leu Trp Gly Ser Asp Tyr Ser 
290 295 300 



Lys Leu Gin Glu Val Glu Val Pro Glu Asp Phe Gly Pro Val Arg Thr 
305 310 315 320 
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Val Ala Glu Gly His Gly Asp Thr Leu Tyr Val Gly Thr Thr Arg Asn 
325 330 335 



Ser He Leu Gin Gly Ser Val His Thr Gly Phe Ser Leu Leu Val Gin 
340 345 350 



Gly His Val Glu Glu Leu Trp Gly Leu Ala Thr His Pro Ser Arg Ala 
355 360 365 



Gin Phe Val Thr Cys Gly Gin Asp Lys Leu Val His Leu Trp Ser Ser 
370 375 380 



Asp Ser His Gin Pro Leu Trp Ser Arg He He Glu Asp Pro Ala Arg 
385 390 395 400 



Ser Ala Gly Phe His Pro Ser Gly Ser Val Leu Ala Val Gly Thr Val 
405 410 415 



Thr Gly Arg Trp Leu Leu Leu Asp Thr Glu Thr His Asp Leu Val Ala 
420 425 430 



He His Thr Asp Gly Asn Glu Gin He Ser Val Val Ser Phe Ser Pro 
435 440 445 



Asp Gly Ala Tyr Leu Ala Val Gly Ser His Asp Asn Leu Val Tyr Val 
450 455 460 



Tyr Thr Val Asp Gin Gly Gly Arg Lys Val Ser Arg Leu Gly Lys Cys 
465 470 475 480 



Ser Gly His Ser Ser Phe He Thr His Leu Asp Trp Ala Gin Asp Ser 
485 490 495 



Ser Cys Phe Val Thr Asn Ser Gly Asp Tyr Glu He Leu Tyr Trp Asp 
500 505 510 



Pro Ala Thr Cys Lys Gin He Thr Ser Ala Asp Ala Val Arg Asn Met 
515 520 525 



Glu Trp Ala Thr Ala Thr Cys Val Leu Gly Phe Gly Val Phe Gly He 
530 535 540 



Trp Ser Glu Gly Ala Asp Gly Thr Asp He Asn Ala Val Ala Arg Ser 
545 550 555 560 
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His Asp Gly Lys Leu Leu Ala Ser Ala Asp Asp Phe Gly Lys Val His 
565 570 575 



Leu Phe Ser Tyr Pro Cys Cys Gin Pro Arg Ala Leu Ser His Lys Tyr 
580 585 590 



Gly Gly His Ser Ser His Val Thr Asn Val Ala Phe Leu Trp Asp Asp 
595 600 605 



Ser Met Ala Leu Thr Thr Gly Gly Lys Asp Thr Ser Val Leu Gin Trp 
610 615 620 



Arg Val Val 
625 
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Met Leu Glu Arg Arg Ala Leu Leu Trp Gin Arg Glu Ala Gly Pro Gly 
15 10 15 



Trp Gly Asp Arg Ala Arg Ala Gly Thr Gly Gly Ala Gly Gly Gly Cys 
20 25 30 



Gly Gly Ala Met Ala Glu Arg Gly Pro Ala Phe Cys Gly Leu Tyr Asp 
35 40 45 



Thr Ser Ser Leu Leu Arg Tyr Cys Asn Asp Asp Asn Leu Ser Gly Thr 
50 55 60 



Ser Gly Met Glu Val Asp Asp Arg Val Ser Ala Leu Glu Gin Arg Leu 
65 70 75 80 



Gin Leu Gin Glu Asp Glu Leu Ala Val Leu Lys Ala Ala Leu Ala Asp 
85 90 95 



Ala Leu Arg Arg Leu Arg Ala Cys Glu Glu Gin Gly Ala Ala Leu Arg 
100 105 110 



Ala Arg Gly Thr Pro Lys Gly Arg Ala Pro Pro Arg Leu Gly Thr Thr 
115 120 125 



Ala Ser Val Cys Gin Leu Leu Lys Gly Leu Pro Thr Arg Thr Pro Leu 
130 135 140 
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Asn Gly Ser Gly Pro Pro Arg Arg Val Gly Gly Tyr Ala Thr Ser Pro 
145 150 155 160 



Ser Ser Pro Lys Lys Glu Ala Thr Ser Gly Arg Ser Ser Val Arg Arg 
165 170 175 



Tyr Leu Ser Pro Glu Arg Leu Ala Ser Val Arg Arg Glu Asp Pro Arg 
180 185 190 



Ser Arg Thr Thr Ser Ser Ser Ser Asn Cys Ser Ala Lys Lys Glu Gly 
195 200 205 



Lys Thr Lys Glu Val lie Phe Ser Val Glu Asp Gly Ser Val Lys Met 
210 215 220 



Phe Leu Arg Gly Arg Pro Val Pro Met Met lie Pro Asp Glu Leu Ala 
225 230 235 240 



Pro Thr Tyr Ser Leu Asp Thr Arg Ser Glu Leu Pro Ser Cys Arg Leu 
245 250 255 



Lys Leu Glu Trp Val Tyr Gly Tyr Arg Gly Arg Asp Cya Arg Ala Asn 
260 265 270 



Leu Tyr Leu Leu Pro Thr Gly Glu He Val Tyr Phe Val Ala Ser Val 
275 280 285 



Ala Val Leu Tyr Ser Val Glu Glu Gin Arg Gin Arg His Tyr Leu Gly 
290 295 300 



His Asn Asp Asp He Lys Cys Leu Ala He His Pro Asp Met Val Thr 
305 310 315 320 



lie Ala Thr Gly Gin Val Ala Gly Thr Thr Lys Glu Gly Lys Pro Leu 
325 330 335 



Pro Pro His Val Arg He Trp Asp Ser Val Ser Leu Ser Thr Leu His 
340 345 350 



Val Leu Gly Leu Gly Val Phe Asp Arg Ala Val Cys Cys Val Gly Phe 
355 360 365 



Ser Lys Ser Asn Gly Gly Asn Leu Leu Cys Ala Val Asp Glu Ser Asn 
370 375 380 
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Asp His Met Leu Ser Val Trp Asp Trp Ala Lys Glu Thr Lys Val Val 
385 390 395 400 



Asp Val Lys Cys Ser Asn Glu Ala Val Leu Val Ala Thr Phe His Pro 
405 410 415 



Thr Asp Pro Thr Val Leu lie Thr Cys Gly Lys Ser His lie Tyr Phe 
420 425 430 



Trp Thr Leu Glu Gly Gly Ser Leu Ser Lys Arg Gin Gly Leu Phe Glu 
435 440 445 



Lys His Glu Lys Pro Lys Tyr Val Leu Cys Val Thr Phe Leu Glu Gly 
450 455 460 



Gly Asp val Val Thr Gly Asp Ser Gly Gly Asn Leu Tyr Val Trp Gly 
465 470 475 480 



Lys Gly Pro 
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Met Ser Ser Phe Gly Ala Gly Lys Thr Lys Glu Val He Phe Ser Val 
15 10 15 



Glu Asp Gly Ser Val Lys Met Phe Leu Arg Gly Arg Pro Val Pro Met 
20 25 30 



Met He Pro Asp Glu Leu Ala Pro Thr Tyr Ser Leu Asp Thr Arg Ser 
35 40 45 



Glu Leu Pro Ser Cys Arg Leu Lys Leu Glu Trp Val Tyr Gly Tyr Arg 
50 55 60 



Gly Arg Asp Cys Arg Ala Asn Leu Tyr Leu Leu Pro Thr Gly Glu He 
65 70 75 80 



Val Tyr Phe Val Ala Ser Val Ala Val Leu Tyr Ser Val Glu Glu Gin 
85 90 95 



Arg Gin Arg His Tyr Leu Gly His Asn Asp Asp He Lys Cys Leu Ala 
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100 105 110 



lie His Pro Asp Met Val Thr lie Ala Thr Gly Gin Val Ala Gly Thr 
115 120 125 



Thr Lys Glu Gly Lys Pro Leu Pro Pro His Val Arg lie Trp Asp Ser 
130 135 140 



Val Ser Leu Ser Thr Leu His Val Leu Gly Leu Gly Val Phe Asp Arg 
145 150 155 160 



Ala Val Cys Cys Val Gly Phe Ser Lys Ser Asn Gly Gly Asn Leu Leu 
165 170 175 



Cys Ala Val Asp Glu Ser Asn Asp His Met Leu Ser Val Trp Asp Trp 
180 185 190 



Ala Lys Glu Thr Lys Val Val Asp Val Lys Cys Ser Asn Glu Ala Val 
195 200 205 



Leu Val Ala Thr Phe His Pro Thr Asp Pro Thr Val Leu lie Thr Cys 
210 215 220 



Gly Lys Ser His lie Tyr Phe Trp Thr Leu Glu Gly Gly Ser Leu Ser 
225 230 235 240 



Lys Arg Gin Gly Leu Phe Glu Lys His Glu Lys Pro Lys Tyr Val Leu 
245 250 255 



Cys Val Thr Phe Leu Glu Gly Gly Asp Val Val Thr Gly Asp Ser Gly 
260 265 270 



Gly Asn Leu Tyr Val Trp Gly Lys Gly Gly Asn Arg lie Thr Gin Ala 
275 280 285 



Val Leu Gly Ala His Asp Gly Gly Val Phe Gly Leu Cys Ala Leu Arg 
290 295 300 



Asp Gly Thr Leu Val Ser Gly Gly Gly Arg Asp Arg Arg Val Val Leu 
305 310 315 320 



Trp Gly Ser Asp Tyr Ser Lys Leu Gin Glu Val Glu Val Pro Glu Asp 
325 330 335 



Phe Gly Pro Val Arg Thr Val Ala Glu Gly His Gly Asp Thr Leu Tyr 
340 345 350 
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Val Gly Thr Thr Arg Asn Ser He Leu Gin Gly Ser Val His Thr Gly 
355 360 365 



Phe Ser Leu Leu Val Gin Asp Pro Ala Arg Ser Ala Gly Phe His Pro 
370 375 380 



Ser Gly Ser Val Leu Ala Val Gly Thr Val Thr Gly Arg Trp Leu Leu 
385 390 395 400 



Leu Asp Thr Glu Thr His Asp Leu Val Ala He His Thr Asp Gly Asn 
405 410 415 



Glu Gin He Ser Val Val Ser Phe Ser Pro Asp Gly Ala Tyr Leu Ala 
420 425 430 



Val Gly Ser His Asp Asn Leu Val Tyr Val Tyr Thr Val Asp Gin Gly 
435 440 445 



Gly Arg Lys Val Ser Arg Leu Gly Lys Cys Ser Gly His Ser Ser Phe 
450 455 460 



He Thr His Leu Asp Trp Ala Gin Asp Ser Ser Cys Phe Val Thr Asn 
465 470 475 480 



Ser Gly Asp Tyr Glu He Leu Tyr Trp Asp Pro Ala Thr Cys Lys Gin 
485 490 495 



He Thr Ser Ala Asp Ala Val Arg Asn Met Glu Trp Ala Thr Ala Thr 
500 505 510 



Cys Val Leu Gly Phe Gly Val Phe Gly He Trp Ser Glu Gly Ala Asp 
515 520 525 



Gly Thr Asp He Asn Ala Val Ala Arg Ser His Asp Gly Lys Leu Leu 
530 535 540 



Ala Ser Ala Asp Asp Phe Gly Lys Val His Leu Phe Ser Tyr Pro Cys 
545 550 555 560 



Cys Gin Pro Arg Ala Leu Ser His Lys Tyr Gly Gly His Ser Ser His 
565 570 575 



Val Thr Asn Val Ala Phe Leu Trp Asp Asp Ser Met Ala Leu Thr Thr 
580 585 590 
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Gly Gly Lys Asp Thr Ser Val Leu Gin Trp Arg Val Val 

600 605 





595 
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Met Ser Ser Phe Gly Ala Gly Lys Thr Lys Glu Val He Phe Ser Val 
15 10 15 



Glu Asp Gly Ser Val Lys Met Phe Leu Arg Gly Arg Pro Val Pro Met 
20 25 30 



Met He Pro Asp Glu Leu Ala Pro Thr Tyr Ser Leu Asp Thr Arg Ser 
35 40 45 



Glu Leu Pro Ser Cys Arg Leu Lys Leu Glu Trp Val Tyr Gly Tyr Arg 
50 55 60 



Gly Arg Asp Cys Arg Ala Asn Leu Tyr Leu Leu Pro Thr Gly Glu He 
65 70 75 80 



Val Tyr Phe Val Ala Ser Val Ala Val Leu Tyr Ser Val Glu Glu Gin 
85 90 95 



Arg Gin Arg His Tyr Leu Gly His Asn Asp Asp He Lys Cys Leu Ala 
100 105 110 



He His Pro Asp Met Val Thr He Ala Thr Gly Gin Val Ala Gly Thr 
' 115 120 125 



Thr Lys Glu Gly Lys Pro Leu Pro Pro His Val Arg He Trp Asp Ser 
130 135 140 



Val Ser Leu Ser Thr Leu His Val Leu Gly Leu Gly Val Phe Asp Arg 
145 150 155 160 



Ala Val Cys Cys Val Gly Phe Ser Lys Ser Asn Gly Gly Asn Leu Leu 
165 170 175 



Cys Ala Val Asp Glu Ser Asn Asp His Met Leu Ser Val Trp Asp Trp 
180 185 190 



Ala Lys Glu Thr Lys Val Val Asp Val Lys Cys Ser Asn Glu Ala Val 
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195 



200 



205 



Leu Val Ala Thr Phe His Pro Thr Asp Pro Thr Val Leu lie Thr Cys 
210 215 220 



Gly Lys Ser His lie Tyr Phe Trp Thr Leu Glu Gly Gly Ser Leu Ser 
225 230 235 240 



Lys Arg Gin Gly Leu Phe Glu Lys His Glu Lys Pro Lys Tyr Val Leu 
245 250 255 



Cys Val Thr Phe Leu Glu Gly Gly Asp Val Val Thr Gly Asp Ser Gly 
260 265 270 



Gly Asn Leu Tyr Val Trp Gly Lys Gly Gly Asn Arg He Thr Gin Ala 
275 280 285 



Val Leu Gly Ala His Asp Gly Gly Val Phe Gly Leu Cys Ala Leu Arg 
290 295 300 



Asp Gly Thr Leu Val Ser Gly Gly Gly Arg Asp Arg Arg Val Val Leu 
305 310 315 320 



Trp Gly Ser Asp Tyr Ser Lys Leu Gin Glu Val Glu Val Pro Glu Asp 
325 330 335 



Phe Gly Pro Val Arg Thr Val Ala Glu Gly His Gly Asp Thr Leu Tyr 
340 345 350 



Val Gly Thr Thr Arg Asn Ser He Leu Gin Gly Ser Val His Thr Gly 
355 360 365 



Phe Ser Leu Leu Val Gin Gly His Val Glu Glu Leu Trp Gly Leu Ala 
370 375 380 



Thr His Pro Ser Arg Ala Gin Phe Val Thr Cys Gly Gin Asp Lys Leu 
385 390 395 400 



Val His Leu Trp Ser Ser Asp Ser His Gin Pro Leu Trp Ser Arg He 
405 410 415 



He Glu Asp Pro Ala Arg Ser Ala Gly Phe His Pro Ser Gly Ser Val 
420 425 430 



Leu Ala Val Gly Thr Val Thr Gly Arg Trp Leu Leu Leu Asp Thr Glu 
435 440 445 
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Thr His Asp Leu Val Ala lie His Thr Asp Gly Asn Glu Gin lie Ser 
450 455 460 



Val Val Ser Phe Ser Pro Gly Pro Phe Gin Phe Tyr His Pro Pro Gly 
465 470 475 480 



Leu Gly Pro Gly Gin Gin Leu Leu Cys His Gin Leu Arg Gly Leu 
485 490 495 
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lie Gly Arg Gly Arg Pro Gly Gin Val Ala Gly Thr Thr Lys Glu Gly 
15 10 15 



Lys Pro Leu Pro Pro His Val Arg lie Trp Asp Ser Val Ser Leu Ser 
20 25 30 



Thr Leu His Val Leu Gly Leu Gly Val Phe Asp Arg Ala Val Cys Cys 
35 40 45 



Val Gly Phe Ser Lys Ser Cys Ser Asn Glu Ala Val Leu Val Ala Thr 
50 55 60 



Phe His Pro Thr Asp Pro Thr Val Leu lie Thr Cys Gly Lys Ser His 
65 70 75 80 



lie Tyr Phe Trp Thr Leu Glu Gly Gly Ser Leu Ser Lys Arg Gin Gly 
85 90 95 



Leu Phe Glu Lys His Glu Lys Pro Lys Tyr Val Leu Cys Val Thr Phe 
100 105 110 



Leu Glu Gly Gly Asp Val Val Thr Gly Asp Ser Gly Gly Asn Leu Tyr 
115 120 125 



Val Trp Gly Lys Gly Gly Asn Arg lie Thr Gin Ala Val Leu Gly Ala 
130 135 140 



His Asp Gly Gly Val Phe Gly Leu Cys Ala Leu Arg Asp Gly Thr Leu 
145 150 155 160 
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Val Ser Gly Gly Gly Arg Asp Arg Arg Val Val Leu Trp Gly Ser Asp 
165 170 175 



Tyr Ser Lys Leu Gin Glu Val Glu Val Pro Glu Asp Phe Gly Pro Val 
180 185 190 



Arg Thr Val Ala Glu Gly His Gly Asp Thr Leu Tyr Val Gly Thr Thr 
195 200 205 



Arg Asn Ser lie Leu Gin Gly Ser Val His Thr Gly Phe Ser Leu Leu 
210 . 215 220 



Val Gin Asp Pro Ala Thr Lys Ser Leu Thr Pro Ser Thr Ala Glu Gly 
225 230 235 240 



Pro Gin Ala Pro Ala Pro Thr Val Leu Pro Pro Ala Thr Leu lie Gly 
245 250 255 



Gly Gly Thr Leu Gin Gly His Val Glu Glu Leu Trp Gly Leu Ala Thr 
260 265 270 



His Pro Ser Arg Ala Gin Phe Val Thr Cys Gly Gin Asp Lys Leu Val 
275 280 285 



His Leu Trp Ser Ser Asp Ser His Gin Pro Leu Trp Ser Arg lie He 
290 295 300 



Glu Asp Pro Ala Arg Ser Ala Gly Phe His Pro Ser Gly Ser Val Leu 
305 310 315 320 



Ala Val Gly Thr Val Thr Gly Arg Trp Leu Leu Leu Asp Thr Glu Thr 
325 330 335 



His Asp Leu Val Ala He His Thr Asp Gly Asn Glu Gin He Ser Val 
340 345 350 



Val Ser Phe Ser Pro Asp Gly Ala Tyr Leu Ala Val Gly Ser His Asp 
355 360 365 



Asn Leu Val Tyr Val Tyr Thr Val Asp Gin Gly Gly Arg Lys Val Ser 
370 375 380 



Arg Leu Gly Lys Cys Ser Gly His Ser Ser Phe He Thr His Leu Asp 
385 390 395 400 



Trp Ala Gin Asp Ser Ser Cys Phe Val Thr Asn Ser Gly Asp Tyr Glu 
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405 410 415 



lie Leu Tyr Trp Asp Pro Ala Thr Cys Lys Gin He Thr Ser Ala Asp 
420 425 430 



Ala Val Arg Asn Met Glu Trp Ala Thr Ala Thr Cys Val Leu Gly Phe 
435 440 445 



Gly Val Phe Gly He Trp Ser Glu Gly Ala Asp Gly Thr Asp He Asn 
450 455 460 



Ala Val Ala Arg Ser His Asp Gly Lys Leu Leu Ala Ser Ala Asp Asp 
465 470 475 480 



Phe Gly Lys Val His Leu Phe Ser Tyr Pro Cys Cys Gin Pro Arg Ala 
485 490 495 



Leu Ser His Lys Tyr Gly Gly His Ser Ser His Val Thr Asn Val Ala 
500 505 510 



Phe Leu Trp Asp Asp Ser Met Ala Leu Thr Thr Gly Gly Lys Asp Thr 
515 520 525 



Ser Val Leu Gin Trp Arg Val Val 
530 535 
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Arg Leu Gly Ser Gly Leu Gly Val Asn Gly Arg Gly Arg Pro Gly Gin 
15 10 15 



Val Ala Gly Thr Thr Lys Glu Gly Lys Pro Leu Pro Pro His Val Arg 
20 25 30 



He Trp Asp Ser Val Ser Leu Ser Thr Leu His Val Leu Gly Leu Gly 
35 40 45 



Val Phe Asp Arg Ala Val Cys Cys Val Gly Phe Ser Lys Ser Cys Ser 
50 55 60 



Asn Glu Ala Val Leu Val Ala Thr Phe His Pro Thr Asp Pro Thr Val 
65 70 75 80 
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Leu lie Thr Cys Gly Lys Ser His lie Tyr Phe Trp Thr Leu Glu Gly 
85 90 95 



Gly Ser Leu Ser Lys Arg Gin Gly Leu Phe Glu Lys His Glu Lys Pro 
100 105 110 



Lys Tyr Val Leu Cys Val Thr Phe Leu Glu Gly Gly Asp Val Val Thr 
115 120 125 



Gly Asp Ser Gly Gly Asn Leu Tyr Val Trp Gly Lys Gly Gly Asn Arg 
130 135 140 



He Thr Gin Ala Val Leu Gly Ala His Asp Gly Gly Val Phe Gly Leu 
145 150 155 160 



Cys Ala Leu Arg Asp Gly Thr Leu Val Ser Gly Gly Gly Arg Asp Arg 
165 170 175 



Arg Val Val Leu Trp Gly Ser Asp Tyr Ser Lys Leu Gin Glu Val Glu 
180 185 190 



Val Pro Glu Asp Phe Gly Pro Val Arg Thr Val Ala Glu Gly His Gly 
195 200 205 



Asp Thr Leu Tyr Val Gly Thr Thr Arg Asn Ser He Leu Gin Gly Ser 
210 215 220 



Val His Thr Gly Phe Ser Leu Leu Val Gin Asp Pro Ala Thr Lys Ser 
225 230 235 240 



Leu Thr Pro Ser Thr Ala Glu Gly Pro Gin Ala Pro Ala Pro Thr Val 
245 250 255 



Leu Pro Pro Ala Thr Leu He Gly Gly Gly Thr Leu Gin Gly His Val 
260 265 270 



Glu Glu Leu Trp Gly Leu Ala Thr His Pro Ser Arg Ala Gin Phe Val 
275 280 285 



Thr Cys Gly Gin Asp Lys Leu Val His Leu Trp Ser Ser Asp Ser His 
290 295 300 



Gin Pro Leu Trp Ser Arg He He Glu Asp Pro Ala Arg Ser Ala Gly 
305 310 315 320 
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Phe His Pro Ser Gly Ser Val Leu Ala Val Gly Thr Val Thr Gly Arg 
325 330 335 



Trp Leu Leu Leu Asp Thr Glu Thr His Asp Leu Val Ala lie His Thr 
340 345 350 



Asp Gly Asn Glu Gin lie Ser Val Val Ser Phe Ser Pro Asp Gly Ala 
355 360 365 



Tyr Leu Ala Val Gly Ser His Asp Asn Leu Val Tyr Val Tyr Thr Val 
370 375 380 



Asp Gin Gly Gly Arg Lys Val Ser Arg Leu Gly Lys Cys Ser Gly His 
385 390 395 400 



Ser Ser Phe He Thr His Leu Asp Trp Ala Gin Asp Ser Ser Cys Phe 
405 410 415 



Val Thr Asn Ser Gly Asp Tyr Glu He Leu Tyr Trp Asp Pro Ala Thr 
420 425 430 



Cys Lys Gin He Thr Ser Ala Asp Ala Val Arg Asn Met Glu Trp Ala 
435 440 445 



Thr Ala Thr Cys Val Leu Gly Phe Gly Val Phe Gly He Trp Ser Glu 
450 455 460 



Gly Ala Asp Gly Thr Asp He Asn Ala Val Ala Arg Ser His Asp Gly 
465 470 475 480 



Lys Leu Leu Ala Ser Ala Asp Asp Phe Gly Lys Val His Leu Phe Ser 
485 490 495 



Tyr Pro Cys Cys Gin Pro Arg Ala Leu Ser His Lys Tyr Gly Gly His 
500 505 510 



Ser Ser His Val Thr Asn Val Ala Phe Leu Trp Asp Asp Ser Met Ala 
515 520 525 



Leu Thr Thr Gly Gly Lys Asp Thr Ser Val Leu Gin Trp Arg Val Val 
530 535 540 



<210> 174 

<211> 482 

<212> PRT 

<213> Homo sapien 
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<220> 

< 2 2 1 > MI SC_FEATURE 

<222> (2) . . (2) 

<223> X=any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (6) . . (6) 

<223> X=any amino acid 



<400> 174 



Ser Xaa Gly His Cys Xaa Asp Phe He Trp Pro Gly His Trp Leu Ser 
15 10 15 



Thr Trp His Trp Ser Arg Gin Arg Pro Ser Trp Gly Lys Leu Met Phe 
20 25 30 



Thr Gly Gly Arg Asn Pro Pro Tyr Leu Gin Ala Ala Ser Gin Pro Gin 
35 40 45 



Glu Ala Thr Arg Leu Ala Glu Ser His Val Glu Ser Ala Ser Asn Met 
50 55 60 



Glu Gin Leu Thr Arg Glu Thr Glu Asp Tyr Ser Lys Gin Ala Leu Ser 
65 70 75 80 



Leu Val Arg Lys Ala Leu His Glu Gly Val Gly Ser Gly Ser Gly Ser 
85 90 95 



Pro Asp Gly Ala Val Val Gin Gly Leu Val Glu Lys Leu Glu Lys Thr 
100 105 110 



Lys Ser Leu Ala Gin Gin Leu Thr Arg Glu Ala Thr Gin Ala Glu He 
115 120 125 



Glu Ala Asp Arg Ser Tyr Gin His Ser Leu Arg Leu Leu Asp Ser Val 
130 135 140 



Ser Arg Leu Gin Gly Val Ser Asp Gin Ser Phe Gin Val Glu Glu Ala 
145 150 155 160 



Lys Arg He Lys Gin Lys Ala Asp Ser Leu Ser Ser Leu Val Thr Arg 
165 170 175 



His Met Asp Glu Phe Lys Arg Thr Gin Lys Asn Leu Gly Asn Trp Lys 
180 185 190 
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Glu Glu Ala Gin Gin Leu Leu Gin Asn Gly Lys Ser Gly Arg Glu Lys 
195 200 205 



Ser Asp Gin Leu Leu Ser Arg Ala Asn Leu Ala Lys Ser Arg Ala Gin 
210 215 220 



Glu Ala Leu Ser Met Gly Asn Ala Thr Phe Tyr Glu Val Glu Ser lie 
225 230 235 240 



Leu Lys Asn Leu Arg Glu Phe Asp Leu Gin Val Asp Asn Arg Lys Ala 
245 250 255 



Glu Ala Glu Glu Ala Met Lys Arg Leu Ser Tyr He Ser Gin Lys Val 
260 265 270 



Ser Asp Ala Ser Asp Lys Thr Gin Gin Ala Glu Arg Ala Leu Gly Ser 
275 280 285 



Ala Ala Ala Asp Ala Gin Arg Ala Lys Asn Gly Ala Gly Glu Ala Leu 
290 295 300 



Glu He Ser Ser Glu He Glu Gin Glu He Gly Ser Leu Asn Leu Glu 
305 310 315 320 



Ala Asn Val Thr Ala Asp Gly Ala Leu Ala Met Glu Lys Gly Leu Ala 
325 330 335 



Ser Leu Lys Ser Glu Met Arg Glu Val Glu Gly Glu Leu Glu Arg Lys 
340 345 350 



Glu Leu Glu Phe Asp Thr Asn Met Asp Ala Val Gin Met Val He Thr 
355 360 365 



Glu Ala Gin Lys Val Asp Thr Arg Ala Lys Asn Ala Gly Val Thr He 
370 375 380 



Gin Asp Thr Leu Asn Thr Leu Asp Gly Leu Leu His Leu Met Asp Gin 
385 390 395 400 



Pro Leu Ser Val Asp Glu Glu Gly Leu Val Leu Leu Glu Gin Lys Leu 
405 410 415 



Ser Arg Ala Lys Thr Gin He Asn Ser Gin Leu Arg Pro Met Met Ser 
420 425 430 
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Glu Leu Glu Glu Arg Ala Arg Gin Gin Arg Gly His Leu His Leu Leu 
435 440 445 



Glu Thr Ser lie Asp Gly He Leu Ala Asp Val Lys Asn Leu Glu Asn 
450 455 460 



He Arg Asp Asn Leu Pro Pro Gly Cys Tyr Asn Thr Gin Ala Leu Glu 
465 470 475 480 



Gin Gin 



<210> 175 

<211> 454 

<212> PRT 

<213> Homo sapien 

<400> 175 

Met Leu Met Phe Thr Gly Gly Arg Asn Pro Pro Tyr Leu Gin Ala Ala 
15 10 15 



Ser Gin Pro Gin Glu Ala Thr Arg Leu Ala Glu Ser His Val Glu Ser 
20 25 30 



Ala Ser Asn Met Glu Gin Leu Thr Arg Glu Thr Glu Asp Tyr Ser Lys 
35 40 45 



Gin Ala Leu Ser Leu Val Arg Lys Ala Leu His Glu Gly Val Gly Ser 
50 55 60 



Gly Ser Gly Ser Pro Asp Gly Ala Val Val Gin Gly Leu Val Glu Lys 
65 70 75 80 



Leu Glu Lys Thr Lys Ser Leu Ala Gin Gin Leu Thr Arg Glu Ala Thr 
85 90 95 



Gin Ala Glu He Glu Ala Asp Arg Ser Tyr Gin His Ser Leu Arg Leu 
100 105 110 



Leu Asp Ser Val Ser Arg Leu Gin Gly Val Ser Asp Gin Ser Phe Gin 
115 120 125 



Val Glu Glu Ala Lys Arg He Lys Gin Lys Ala Asp Ser Leu Ser Ser 
130 135 140 



Leu Val Thr Arg His Met Asp Glu Phe Lys Arg Thr Gin Lys Asn Leu 
145 150 155 160 
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Gly Asn Trp Lys Glu Glu Ala Gin Gin Leu Leu Gin Asn Gly Lys Ser 
165 170 175 



Gly Arg Glu Lys Ser Asp Gin Leu Leu Ser Arg Ala Asn Leu Ala Lys 
180 185 190 



Ser Arg Ala Gin Glu Ala Leu Ser Met Gly Asn Ala Thr Phe Tyr Glu 
195 200 205 



Val Glu Ser lie Leu Lys Asn Leu Arg Glu Phe Asp Leu Gin Val Asp 
210 215 220 



Asn Arg Lys Ala Glu Ala Glu Glu Ala Met Lys Arg Leu Ser Tyr lie 
225 230 235 240 



Ser Gin Lys Val Ser Asp Ala Ser Asp Lys Thr Gin Gin Ala Glu Arg 
245 250 255 



Ala Leu Gly Ser Ala Ala Ala Asp Ala Gin Arg Ala Lys Asn Gly Ala 
260 265 270 



Gly Glu Ala Leu Glu lie Ser Ser Glu He Glu Gin Glu He Gly Ser 
275 280 285 



Leu Asn Leu Glu Ala Asn Val Thr Ala Asp Gly Ala Leu Ala Met Glu 
290 295 300 



Lys Gly Leu Ala Ser Leu Lys Ser Glu Met Arg Glu Val Glu Gly Glu 
305 310 315 320 



Leu Glu Arg Lys Glu Leu Glu Phe Asp Thr Asn Met Asp Ala Val Gin 
325 330 335 



Met Val He Thr Glu Ala Gin Lys Val Asp Thr Arg Ala Lys Asn Ala 
340 345 350 



Gly Val Thr He Gin Asp Thr Leu Asn Thr Leu Asp Gly Leu Leu His 
355 360 365 



Leu Met Asp Gin Pro Leu Ser Val Asp Glu Glu Gly Leu Val Leu Leu 
370 375 380 



Glu Gin Lys Leu Ser Arg Ala Lys Thr Gin He Asn Ser Gin Leu Arg 
385 390 395 400 
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Pro Met Met Ser Glu Leu Glu Glu Arg Ala Arg Gin Gin Arg Gly His 
405 410 415 



Leu His Leu Leu Glu Thr Ser lie Asp Gly lie Leu Ala Asp Val Lys 
420 425 430 



Asn Leu Glu Asn lie Arg Asp Asn Leu Pro Pro Gly Cys Tyr Asn Thr 
435 440 445 



Gin Ala Leu Glu Gin Gin 
450 



<210> 176 

<211> 340 

<212> PRT 

<213> Homo sapien 

<400> 176 

Met His Asp Val Lys Asn His Arg Thr Phe Leu Lys Arg Thr Lys Tyr 
15 10 15 



Asp Asn Leu His Leu Glu Asp Leu Phe lie Gly Asn Lys Val Asn Val 
20 25 30 



Phe Ser Arg Gin Leu Val Leu He Asp Tyr Gly Asp Gin Tyr Thr Ala 
35 40 45 



Arg Gin Leu Gly Ser Arg Lys Glu Lys Thr Leu Ala Leu He Lys Pro 
50 55 60 



Asp Ala He Ser Lys Ala Gly Glu He He Glu He He Asn Lys Ala 
65 70 75 80 



Gly Phe Thr He Thr Lys Leu Lys Met Met Met Leu Ser Arg Lys Glu 
85 90 95 



Ala Leu Asp Phe His Val Asp His Gin Ser Arg Pro Phe Phe Asn Glu 
100 105 110 



Leu He Gin Phe He Thr Thr Gly Pro He He Ala Met Glu He Leu 
115 120 125 



Arg Asp Asp Ala He Cys Glu Trp Lys Arg Leu Leu Gly Pro Ala Asn 
130 135 140 



Ser Gly Val Ala Arg Thr Asp Ala Ser Glu Ser He Arg Ala Leu Phe 
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145 150 155 160 



Gly Thr Asp Gly He Arg Asn Ala Ala His Gly Pro Asp Ser Phe Ala 
165 170 175 



Ser Ala Ala Arg Glu Met Glu Leu Phe Phe Pro Ser Ser Gly Gly Cys 
180 185 190 



Gly Pro Ala Asn Thr Ala Lys Phe Thr Asn Cys Thr Cys Cys He Val 
195 200 205 



Lys Pro His Ala Val Ser Glu Gly Leu Leu Gly Lys lie Leu Met Ala 
210 215 220 



He Arg Asp Ala Gly Phe Glu He Ser Ala Met Gin Met Phe Asn Met 
225 230 235 240 



Asp Arg Val Asn Val Glu Glu Phe Tyr Glu Val Tyr Lys Gly Val Val 
245 250 255 



Thr Glu Tyr His Asp Met Val Thr Glu Met Tyr Ser Gly Pro Cys Val 
260 265 270 



Ala Met Glu He Gin Gin Asn Asn Ala Thr Lys Thr Phe Arg Glu Phe 
275 280 285 



Cys Gly Pro Ala Asp Pro Glu He Ala Arg His Leu Arg Pro Gly Thr 
290 295 300 



Leu Arg Ala He Phe Gly Lys Thr Lys He Gin Asn Ala Val His Cys 
305 310 315 320 



Thr Asp Leu Pro Glu Asp Gly Leu Leu Glu Val Gin Tyr Phe Phe Lys 
325 330 335 



He Leu Asp Asn 
340 



<210> 177 

<211> 304 

<212> PRT 

<213> Homo sapien 

<220> 

<221> MXSC^FEATURE 

<222> (264) . . (264) 

<223> X-any amino acid 
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<400> 177 



Thr Gly Pro Val Ala Met Gly Arg Val He Arg Gly Gin Arg Lys Gly 
15 10 15 



Ala Gly Ser Val Phe Arg Ala His Val Lys His Arg Lys Gly Ala Ala 
20 25 30 



Arg Leu Arg Ala Val Asp Phe Ala Glu Arg His Gly Tyr He Lys Gly 
35 40 45 



He Val Lys Asp He He His Asp Pro Gly Arg Gly Ala Pro Leu Ala 
50 55 60 



Lys Val Val Phe Arg Asp Pro Tyr Arg Phe Lys Lys Arg Thr Glu Leu 
65 70 75 80 



Phe He Ala Ala Glu Gly He His Thr Gly Gin Phe Val Tyr Cys Gly 
85 90 95 



Lys Lys Ala Gin Leu Asn He Gly Asn Val Leu Pro Val Gly Thr Met 
100 105 110 



Pro Glu Gly Thr He Val Cys Cys Leu Glu Glu Lys Pro Gly Asp Arg 
115 120 125 



Gly Lys Leu Ala Arg Ala Ser Gly Asn Tyr Ala Thr Val He Ser His 
130 135 140 



Asn Pro Glu Thr Lys Lys Thr Arg Val Lys Leu Pro Ser Gly Ser Lys 
145 150 155 160 



Lys Val He Ser Ser Ala Asn Arg Ala Val Val Gly Val Val Ala Gly 
165 170 175 



Gly Gly Arg He Asp Lys Pro He Leu Lys Ala Gly Arg Ala Tyr His 
180 185 190 



Lys Tyr Lys Ala Lys Arg Asn Cys Trp Pro Arg Val Arg Gly Val Ala 
195 200 205 



Met Asn Pro Val Glu His Pro Phe Gly Gly Gly Asn His Gin His He 
210 215 220 



Gly Lys Pro Ser Thr He Arg Arg Asp Ala Pro Ala Gly Arg Lys Val 
225 230 235 240 
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Gly Leu lie Ala Ala Arg Arg Thr Gly Arg Leu Arg Gly Thr Lys Thr 
245 250 255 



Val Gin Glu Asn Met Trp Ala Xaa Ser Gly Phe Ala Glu Lys Asn Thr 
260 265 270 



Thr Thr Gin Thr Gin Arg Gin Thr Tyr Arg Lys Lys Gly Gly Tyr Arg 
275 280 285 



Gly Arg His Ser Arg Gly Asn lie lie Ala Ala Glu Asp Arg Gly Gly 
290 295 300 



<210> 178 

<211> 185 

<212> PRT 

<213> Homo sapien 

<400> 178 

Met Pro Glu Gly Thr lie Val Cys Cys Leu Glu Glu Lys Pro Gly Asp 
1.5 10 15 



Arg Gly Lys Leu Ala Arg Ala Ser Gly Asn Tyr Ala Thr Val lie Ser 
20 25 30 



His Asn Pro Glu Thr Lys Lys Thr Arg Val Lys Leu Pro Ser Gly Ser 
35 40 45 



Lys Lys Val lie Ser Ser Ala Asn Arg Ala Val Val Gly Val Val Ala 
50 55 60 



Gly Gly Gly Arg lie Asp Lys Pro lie Leu Lys Ala Gly Arg Ala Tyr 
65 70 75 80 



His Lys Tyr Lys Ala Lys Arg Asn Cys Trp Pro Arg Val Arg Gly Val 
85 90 95 



Ala Met Asn Pro Val Glu His Pro Phe Gly Gly Gly Asn His Gin His 
100 105 110 



lie Gly Lys Pro Ser Thr lie Arg Arg Asp Ala Pro Ala Gly Arg Lys 
115 120 125 



Val Gly Leu lie Ala Ala Arg Arg Thr Gly Arg Leu Arg Gly Thr Lys 
130 135 140 
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Thr Val Gin Glu Asn Met Trp Ala His Lys Trp Val Cys Arg Glu Lys 
US 150 155 160 



Thr Gin Gin Arg Lys His Lys Gly Lys His He Glu Lys Lys Gly Ala 
165 170 175 



Thr Gly Ala Asp Thr Leu Glu Val He 
180 185 



<210> 179 

<211> 484 

<212> PRT 

<213> Homo sapien 

<400> 179 

His Gly Lys Arg Gly Arg His Gly Lys Arg Gly Arg His Gly Met Val 
1 5 10 . 15 



Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala 
20 25 30 



Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala 
35 40 45 



Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val 
50 55 60 



Ser Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val 
65 70 75 80 



Val He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr 
85 90 95 



Phe Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala 
100 105 110 



Leu Ala His His Val Leu Asn Lys Asp Leu Lys Asp He Val Gly Asp 
115 120 125 



Phe Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp 
130 135 140 



He Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu 
145 150 155 160 



Asn Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp 
165 170 175 
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Lys Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys lie Asp 
180 185 190 



Phe Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He 
195 200 205 



Leu Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu Lys Gin Met Leu 
210 215 220 



Ala Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr 
225 230 235 240 



Ser Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly 
245 250 255 



Trp Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met 
260 265 270 



Arg Arg Cys Gin He He Arg Asp Met He Gly Pro Glu Lys Thr Leu 
275 280 285 



Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp 
290 295 300 



Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp He Glu Glu Pro Thr 
305 310 315 320 



Ser Pro Asp Asp He Leu Gly His Ala Thr He Ser Lys Ala Leu Val 
325 330 335 



Pro Leu Gly He Gly He Ala Thr Gly Glu Gin Cys His Asn Arg Val 
340 345 350 



He Phe Lys Gin Leu Leu Gin Ala Lys Ala Leu Gin Phe Leu Gin He 
355 360 365 



Asp Ser Cys Arg Leu Gly Ser Val Asn Glu Asn Leu Ser Val Leu Leu 
370 375 380 



Met Ala Lys Lys Phe Glu He Pro Val Cys Pro His Ala Gly Gly Val 
385 390 395 400 



Gly Leu Cys gIu Leu Val Gin His Leu He He Phe Asp Tyr He Ser 
405 410 415 
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Val Ser Ala Ser Leu Glu Asn Arg Val Cys Glu Tyr Val Asp His Leu 
420 425 430 



His Glu His Phe Lys Tyr Pro Val Met lie Gin Arg Ala Ser Tyr Met 
435 440 445 



Pro Pro Lys Asp Pro Gly Tyr Ser Thr Glu Met Lys Glu Glu Ser Val 
450 455 460 



Lys Lys His Gin Tyr Pro Asp Gly Glu Val Trp Lys Lys Leu Leu Pro 
465 470 475 480 



Ala Gin Glu Asn 



<210> 180 

<211> 483 

<212> PRT 

<213> Homo sapien 

<400> 180 

Met Val ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Trp Ser 
15 10 15 



Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met 
20 25 30 



Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp 
35 40 45 



Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser 
50 55 60 



Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val Val 
65 70 75 80 



He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr Phe 
85 90 95 



Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala Leu 
100 105 110 



Ala His His Val Leu Asn Lys Asp Leu Lys Asp He Val Gly Asp Phe 
115 120 125 



Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp He 
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236 

135 140 



Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu Asn 
145 150 155 160 



Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp Lys 
165 170 175 



Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys He Asp Phe 
180 185 190 



Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He Leu 
195 200 205 



Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu Lys Gin Met Leu Ala 
210 215 220 



Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr Ser 
225 230 235 240 



Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly Trp 
245 250 255 



Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met Arg 
260 265 270 



Arg Cys Gin He He Arg Asp Met He Gly Pro Glu Lys Thr Leu Met 
275 280 285 



Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp Met 
290 295 300 



Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp He Glu Glu Pro Thr Ser 
305 310 315 320 



Pro Asp Asp He Leu Gly His Ala Thr He Ser Lys Ala Leu Val Pro 
325 330 335 



LeAi Gly He Gly He Ala Thr Gly Glu Gin Cys His Asn Arg Val He 
340 345 350 



Phe Lys Gin Leu Leu Gin Ala Lys Ala Leu Gin Phe Leu Gin He Asp 
355 360 365 



Ser Cys Arg Leu Gly Ser Val Asn Glu Asn Leu Ser Val Leu Leu Met 
370 375 380 
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Ala Lys Lys Phe Glu He Pro Val Cys Pro His Ala Gly Gly Val Gly 
385 390 395 400 



Leu Cys Glu Leu Val Gin His Leu He He Phe Asp Tyr He Ser Val 
405 410 415 



Ser Ala Ser Leu Glu Asn Arg Val Cys Glu Tyr Val Asp His Leu His 
420 425 430 



Glu His Phe Lys Tyr Pro Val Met He Gin Arg Ala Ser Tyr Met Pro 
435 440 445 



Pro Lys Asp Pro Gly Tyr Ser Thr Glu Met Lys Glu Glu Ser Val Lys 
450 455 460 



Lys His Gin Tyr Pro Asp Gly Glu Val Trp Lys Lys Leu Leu Pro Ala 
465 470 475 480 



Gin Glu Asn 



<210> 181 

<211> 484 

<212> PRT 

<213> Homo sapien 

<400> 181 

His Gly Lys Arg Gly Arg His Gly Lys Arg Gly Arg His Gly Met Val 
15 10 15 



Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala 
20 25 30 



Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala 
35 40 45 



Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val 
50 55 60 



Ser Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val 
65 70 75 80 



Val He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr 
85 90 95 
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Phe Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala 
100 105 110 



Leu Ala His His Val Leu Asn Lys Asp Leu Lys Asp He Val Gly Asp 
115 120 125 



Phe Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp 
130 135 140 



He Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu 
145 150 155 160 



Asn Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp 
165 170 175 



Lys Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys He Asp 
180 185 190 



Phe Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He 
195 200 205 



Leu Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu Lys Gin Met Leu 
210 215 220 



Ala Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr 
225 230 235 240 



Ser Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly 
245 250 255 



Trp Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met 
260 265 270 



Arg Arg Cys Gin He He Arg Asp Met He Gly Pro Glu Lys Thr Leu 
275 280 285 



Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp 
290 295 300 



Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp He Glu Glu Pro Thr 
305 310 315 320 



Ser Pro Asp Asp He Leu Gly His Ala Thr He Ser Lys Ala Leu Val 
325 330 335 



Pro Leu Gly He Gly He Ala Thr Gly Glu Gin Cys His Asn Arg Val 
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340 345 350 



He Phe Lys Gin Leu Leu Gin Ala Lys Ala Leu Gin Phe Leu Gin He 
355 360 365 



Asp Ser Cys Arg Leu Gly Ser Val Asn Glu Asn Leu Ser Val Leu Leu 
370 375 380 



Met Ala Lys Lys Phe Glu He Pro Val Cys Pro His Ala Gly Gly Val 
385 390 395 400 



Gly Leu Cys Glu Leu Val Gin His Leu He He Phe Asp Tyr He Ser 
405 410 415 



Val Ser Ala Ser Leu Glu Asn Arg Val Cys Glu Tyr Val Asp His Leu 
420 425 430 



His Glu His Phe Lys Tyr Pro Val Met He Gin Arg Ala Ser Tyr Met 
435 440 445 



Pro Pro Lys Asp Pro Gly Tyr Ser Thr Glu Met Lys Glu Glu Ser Val 
450 455 460 



Lys Lys His Gin Tyr Pro Asp Gly Glu Val Trp Lys Lys Leu Leu Pro 
465 470 475 480 



Ala Gin Glu Asn 



<210> 182 

<211> 484 

<212> PRT 

<213> Homo sapien 

<400> 182 

His Gly Lys Arg Gly Arg His Gly Lys Arg Gly Arg His Gly Met Val 
15 10 15 



Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala 
20 25 30 



Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala 
35 40 45 



Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val 
50 55 60 
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Ser Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val 
65 70 75 80 



Val He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr 
85 90 95 



Phe Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala 
100 105 110 



Leu Ala His His Val Leu Asn Lys Asp Leu Lys Asp He Val Gly Asp 
115 120 125 



Phe Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp 
130 135 140 



He Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu 
145 150 155 160 



Asn Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp 
165 170 175 



Lys Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys He Asp 
180 185 190 



Phe Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He 
195 200 205 



Leu Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu Lys Gin Met Leu 
210 215 220 



Ala Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr 
225 230 235 240 



Ser Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly 
245 250 255 



Trp Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met 
260 265 270 



Arg Arg Cys Gin He He Arg Asp Met He Gly Pro Glu Lys Thr Leu 
275 280 285 



Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp 
290 295 300 
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Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp lie Glu Glu Pro Thr 
305 310 315 320 



Ser Pro Asp Asp lie Leu Gly His Ala Thr lie Ser Lys Ala Leu Val 
325 330 335 



Pro Leu Gly He Gly He Ala Thr Gly Glu Gin Cys His Asn Arg Val 
340 345 350 



He Phe Lys Gin Leu Leu Gin Ala Lys Ala Leu Gin Phe Leu Gin He 
355 360 365 



Asp Ser Cys Arg Leu Gly Ser Val Asn Glu Asn Leu Ser Val Leu Leu 
370 375 380 



Met Ala Lys Lys Phe Glu He Pro Val Cys Pro His Ala Gly Gly Val 
385 390 395 400 



Gly Leu Cys Glu Leu Val Gin His Leu He He Phe Asp Tyr He Ser 
405 410 415 



Val Ser Ala Ser Leu Glu Asn Arg Val Cys Glu Tyr Val Asp His Leu 
420 425 430 



His Glu His Phe Lys Tyr Pro Val Met He Gin Arg Ala Ser Tyr Met 
435 440 445 



Pro Pro Lys Asp Pro Gly Tyr Ser Thr Glu Met Lys Glu Glu Ser Val 
450 455 460 



Lys Lys His Gin Tyr Pro Asp Gly Glu Val Trp Lys Lys Leu Leu Pro 
465 470 475 480 



Ala Gin Glu Asn 



<210> 183 

<211> 249 

<212> PRT 

<213> Homo sapien 

<400> 183 

Arg Met Ala Gly Pro Gly Glu Cys Asp Asp Gly Pro Asp Phe Pro Ser 
15 10 15 



Trp Arg Gin Glu Arg Leu Arg Gin Phe Lys Val Lys Val Gly Ala Asp 
20 25 30 
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Leu Gin Asp Asp Met Arg Arg Cys Gin lie lie Arg Asp Met He Gly 
35 40 45 



Pro Glu Lys Thr Leu Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro 
50 55 60 



Glu Ala Val Glu Trp Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp 
65 70 75 80 



He Glu Glu Pro Thr Ser Pro Asp Asp He Leu Gly His Ala Thr He 
85 90 95 



Ser Lys Ala Leu Val Pro Leu Gly He Gly He Ala Thr Gly Glu Gin 
100 105 110 



Cys His Asn Arg Val He Phe Lys Gin Leu Leu Gin Ala Lys Ala Leu 
115 120 125 



Gin Phe Leu Gin He Asp Ser Cys Arg Leu Gly Ser Val Asn Glu Asn 
130 135 140 



Leu Ser Val Leu Leu Met Ala Lys Lys Phe Glu He Pro Val Cys Pro 
145 150 155 160 



His Ala Gly Gly Val Gly Leu Cys Glu Leu Val Gin His Leu He He 
165 170 175 



Phe Asp Tyr He Ser Val Ser Ala Ser Leu Glu Asn Arg Val Cys Glu 
180 185 190 



Tyr Val Asp His Leu His Glu His Phe Lys Tyr Pro Val Met He Gin 
195 200 205 



Arg Ala Ser Tyr Met Pro Pro Lys Asp Pro Gly Tyr Ser Thr Glu Met 
210 215 220 



Lys Glu Glu Ser Val Lys Lys His Gin Tyr Pro Asp Gly Glu Val Trp 
225 230 235 240 



Lys Lys Leu Leu Pro Ala Gin Glu Asn 
245 



<210> 184 
<211> 221 
<212> PRT 
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<213> Homo sapien 
<400> 184 

Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Trp Ser 
15 10 15 



Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met 
20 25 30 



Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp 
35 40 45 



Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser 
50 55 60 



Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val Val 
65 70 75 80 



lie Glu Thr Asp Ala Glu Asp Gly lie Lys Gly Cys Gly lie Thr Phe 
85 90 95 



Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala Leu 
100 105 110 



Ala His His Val Leu Asn Lys Asp Leu Lys Asp lie Val Gly Asp Phe 
115 120 125 



Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp lie 
130 135 140 



Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu Asn 
145 150 155 160 



Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp Lys 
165 170 175 



Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys lie Asp Phe 
180 185 190 



Arg Tyr lie Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu lie Leu 
195 200 205 



Gin Lys Gly Gin He Gly Lys Lys Glu Arg Gly Gly Leu 
210 215 220 



<210> 185 
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<211> 416 
<212> PRT 
<213> Homo sapien 

<400> 185 

His Gly Lys Arg Gly Arg His Gly Lys Arg Gly Arg His Gly Met Val 
15 10 15 



Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala 
20 25 30 



Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala 
35 40 45 



Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val 
50 55 60 



Ser Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val 
65 70 75 80 



Val He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr 
85 90 95 



Phe Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala 
100 105 110 



Leu Ala His His Val Leu Asn Lys Asp Leu Lys Asp He Val Gly Asp 
115 120 125 



Phe Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp 
130 135 140 



He Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu 
145 150 155 160 



Asn Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp 
165 170 175 



Lys Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys He Asp 
180 185 190 



Phe Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He 
195 200 205 



Leu Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu Lys Gin Met Leu 
210 215 220 
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Ala Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr 
225 230 235 240 



Ser Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly 
245 250 255 



Trp Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met 
260 265 270 



Arg Arg Cys Gin He He Arg Asp Met He Gly Pro Glu Lys Thr Leu 
275 280 285 



Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp 
290 295 300 



Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp He Glu Glu Pro Thr 
305 310 315 320 



Ser Pro Asp Asp He Leu Gly His Ala Thr He Ser Lys Ala Leu Val 
325 330 335 



Pro Leu Gly lie Gly He Ala Thr Gly Glu Gin Val Ser Asp Ala Pro 
340 345 350 



Asn Arg Trp Met Thr Ser Pro Trp Gly Gin Tyr Thr Leu Thr Ser Asp 
355 360 365 



Arg Gly His Ser Cys Val Leu Gly Ser He Thr Cys Cys Thr Leu Ser 
370 375 380 



Trp Glu He Phe He He Leu Glu Thr Gly Ser Phe Tyr Gin Ser Leu 
385 390 395 400 



Glu Ser Asp He Glu Lys Val Cys Gly Tyr Phe Ser Asn Leu Tyr Asp 
405 410 415 



<210> 186 

<211> 415 

<212> PRT 

<213> Homo sapien 

<400> 186 

Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Trp Ser 
IS 10 15 



Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met 
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20 



25 



30 



Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp 
35 40 45 



Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser 
50 55 60 



Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val Val 
65 70 75 80 



He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr Phe 
85 90 95 



Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala Leu 
100 105 110 



Ala His His Val Leu Asn Lys Asp Leu Lys Asp He Val Gly Asp Phe 
115 120 125 



Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp He 
130 135 140 



Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu Asn 
145 150 155 160 



Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp Lys 
165 170 175 



Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys He Asp Phe 
180 185 190 



Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He Leu 
195 200 205 



Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu Lys Gin Met Leu Ala 
210 215 220 



Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr Ser 
225 230 235 240 



Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly Trp 
245 250 255 



Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met Arg 
260 265 270 



wo 2004/050860 



PCT/US2003/040063 



247 



Arg Cys Gin He He Arg Asp Met He Gly Pro Glu Lys Thr Leu Met 
275 280 285 



Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp Met 
290 295 300 



Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp He Glu Glu Pro Thr Ser 
305 310 315 320 



Pro Asp Asp He Leu Gly His Ala Thr He Ser Lys Ala Leu Val Pro 
325 330 335 



Leu Gly He Gly He Ala Thr Gly Glu Gin Val Ser Asp Ala Pro Asn 
340 345 350 



Arg Trp Met Thr Ser Pro Trp Gly Gin Tyr Thr Leu Thr Ser Asp Arg 
355 360 365 



Gly His Ser Cys Val Leu Gly Ser He Thr Cys Cys Thr Leu Ser Trp 
370 375 380 



Glu He Phe He He Leu Glu Thr Gly Ser Phe Tyr Gin Ser Leu Glu 
385 390 395 400 



Ser Asp He Glu Lys Val Cys Gly Tyr Phe Ser Asn Leu Tyr Asp 
405 410 415 



<210> 187 

<211> 484 

<212> PRT 

<213> Homo sapien 

<400> 187 

His Gly Lys Arg Gly Arg His Gly Lys Arg Gly Arg His Gly Met Val 
^5 10 15 



Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala 
20 25 30 



Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala 
35 40 45 



Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val 
50 55 60 
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Ser Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val 
65 70 75 80 



Val He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr 
85 90 95 



Phe Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala 
100 105 110 



Leu Ala His His Val Leu Asn Lys Asp Leu Lys Asp He Val Gly Asp 
115 120 125 



Phe Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp 
130 135 140 



He Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu 
145 150 155 160 



Asn Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp 
165 170 175 



Lys Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys He Asp 
180 185 190 



Phe Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He 
195 200 205 



Leu Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu Lys Gin Met Leu 
210 215 220 



Ala Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr 
225 230 235 240 



Ser Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly 
245 250 255 



Trp Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met 
260 265 270 



Arg Arg Cys Gin He He Arg Asp Met He Gly Pro Glu Lys Thr Leu 
275 280 285 



Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp 
290 295 300 



Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp He Glu Glu Pro Thr 
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305 310 315 320 



Ser Pro Asp Asp He Leu Gly His Ala Thr He Ser Lys Ala Leu Val 
325 330 335 



Pro Leu Gly He Gly He Ala Thr Gly Glu Gin Cys His Asn Arg Val 
340 345 350 



He Phe Lys Gin Leu Leu Gin Ala Lys Ala Leu Gin Phe Leu Gin He 
355 360 365 



Asp Ser Cys Arg Leu Gly Ser Val Asn Glu Asn Leu Ser Val Leu Leu 
370 375 380 



Met Ala Lys Lys Phe Glu He Pro Val Cys Pro His Ala Gly Gly Val 
385 390 395 400 



Gly Leu Cys Glu Leu Val Gin His Leu He He Phe Asp Tyr He Ser 
405 410 415 



Val Ser Ala Ser Leu Glu Asn Arg Val Cys Glu Tyr Val Asp His Leu 
420 425 430 



His Glu His Phe Lys Tyr Pro Val Met He Gin Arg Ala Ser Tyr Met 
435 440 445 



Pro Pro Lys Asp Pro Gly Tyr Ser Thr Glu Met Lys Glu Glu Ser Val 
450 455 460 



Lys Lys His Gin Tyr Pro Asp Gly Glu Val Trp Lys Lys Leu Leu Pro 
465 470 475 480 



Ala Gin Glu Asn 



<210> 188 

<211> 349 

<212> PRT 

<213> Homo sapien 

<400> 188 

His Gly Lys Arg Gly Arg His Gly Lys Arg Gly Arg His Gly Met Val 
^5 10 15 



Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala 
20 25 30 
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Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala 
35 40 45 



Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val 
50 55 60 



Ser Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val 
65 70 75 80 



Val He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr 
85 90 95 



Phe Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala 
100 105 110 



Leu Ala His His Val Leu Asn Lys Asp Leu Lys Asp He Val Gly Asp 
115 120 125. 



Phe Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp 
130 135 140 



He Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu 
145 150 155 160 



Asn Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp 
165 170 175 



Lys Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys He Asp 
180 185 190 



Phe Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He 
195 200 205 



Leu Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu Lys Gin Met Leu 
210 215 220 



Ala Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr 
225 230 235 240 



Ser Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly 
245 250 255 



Trp Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met 
260 265 270 
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Arg Arg Cys Gin lie He Arg Asp Met He Gly Pro Glu Lys Thr Leu 
275 280 285 



Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp 
290 295 300 



Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp He Glu Glu Pro Thr 
305 310 315 320 



Ser Pro Asp Asp He Leu Gly His Ala Thr He Ser Lys Ala Leu Val 
325 330 335 



Pro Leu Gly He Gly He Ala Thr Gly Glu Gin Gly Val 
340 345 



<210> 189 

<211> 305 

<212> PRT 

<213> Homo sapien 

<400> 189 

Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala 
15 10 15 



Asp Ala Met Val Ser Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser 
20 25 30 



Ala Ala Tyr Val Val He Glu Thr Asp Ala Glu Asp Gly He Lys Gly 
35 40 45 



Cys Gly He Thr Phe Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys 
50 55 60 



Ala Val Asn Ala Leu Ala His His Val Leu Asn Lys Asp Leu Lys Asp 
65 70 75 80 



He Val Gly Asp Phe Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly 
85 90 95 



Gin Leu Arg Trp He Gly Pro Glu Lys Gly Val Val His Leu Ala Thr 
100 105 110 



Ala Ala Val Leu Asn Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly 
115 120 125 



Lys Pro Val Trp Lys Leu Leu Val Asp Met Asp Pro Arg Met Leu Val 
130 135 140 
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Ser Cys lie Asp Phe Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp 
145 150 155 160 



Ala Leu Glu lie Leu Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu 
165 170 175 



Lys Gin Met Leu Ala Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala 
180 185 190 



Trp Leu Gly Tyr Ser Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala 
195 200 205 



Leu Lys Asp Gly Trp Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu 
210 215 220 



Gin Asp Asp Met Arg Arg Cys Gin He He Arg Asp Met He Gly Pro 
225 230 235 240 



Glu Lys Thr Leu Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu 
245 250 255 



Ala Val Glu Trp Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp He 
260 265 270 



Glu Glu Pro Thr Ser Pro Asp Asp He Leu Gly His Ala Thr He Ser 
275 280 285 



Lys Ala Leu Val Pro Leu Gly He Gly He Ala Thr Gly Glu Gin Gly 
290 295 300 



Val 
305 



<210> 190 

<211> 484 

<212> PRT 

<2I3> Homo sapien 

<400> 190 

His Gly Lys Arg Gly Arg His Gly Lys Arg Gly Arg His Gly Met Val 
15 10 15 



Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala 
20 25 30 
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Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala 
35 40 45 



Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val 
50 55 60 



Ser Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val 
65 70 75 80 



Val He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr 
85 90 95 



Phe Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala 
100 105 110 



Leu Ala His His Val Leu Asn Lys Asp Leu Lys Asp He Val Gly Asp 
115 120 125 



Phe Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp 
130 135 140 



He Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu 
145 150 155 160 



Asn Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp 
165 170 175 



Lys Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys He Asp 
180 185 190 



Phe Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He 
195 200 205 



Leu Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu Lys Gin Met Leu 
210 215 220 



Ala Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr 
225 230 235 240 



Ser Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly 
245 250 255 



Trp Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met 
260 265 270 



Arg Arg Cys Gin He He Arg Asp Met He Gly Pro Glu Lys Thr Leu 
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275 280 285 



Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp 
290 295 300 



Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp lie Glu Glu Pro Thr 
305 310 315 320 



Ser Pro Asp Asp He Leu Gly His Ala Thr He Ser Lys Ala Leu Val 
325 330 335 



Pro Leu Gly He Gly He Ala Thr Gly Glu Gin Cys His Asn Arg Val 
340 345 350 



He Phe Lys Gin Leu Leu Gin Ala Lys Ala Leu Gin Phe Leu Gin He 
355 360 365 



Asp Ser Cys Arg Leu Gly Ser Val Asn Glu Asn Leu Ser Val Leu Leu 
370 375 380 



Met Ala Lys Lys Phe Glu He Pro Val Cys Pro His Ala Gly Gly Val 
385 390 395 400 



Gly Leu Cys Glu Leu Val Gin His Leu He He Phe Asp Tyr He Ser 
405 410 415 



Val Ser Ala Ser Leu Glu Asn Arg Val Cys Glu Tyr Val Asp His Leu 
420 425 430 



His Glu His Phe Lys Tyr Pro Val Met He Gin Arg Ala Ser Tyr Met 
435 440 445 



Pro Pro Lys Asp Pro Gly Tyr Ser Thr Glu Met Lys Glu Glu Ser Val 
450 455 460 



Lys Lys His Gin Tyr Pro Asp Gly Glu Val Trp Lys Lys Leu Leu Pro 
465 470 475 480 



Ala Gin Glu Asn 



<210> 191 

<211> 484 

<212> PRT 

<213> Homo sapien 



<400> 191 
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His Gly Lys Arg Gly Arg His Gly Lys Arg Gly Arg His Gly Met Val 
15 10 15 



Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala 
20 25 30 



Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala 
35 40 45 



Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val 
50 55 60 



Ser Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val 
65 70 75 80 



Val He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr 
85 90 95 



Phe Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala 
100 105 110 



Leu Ala His His Val Leu Asn Lys Asp Leu Lys Asp lie Val Gly Asp 
115 120 125 



Phe Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp 
130 135 140 



He Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu 
145 150 155 160 



Asn Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp 
165 170 175 



Lys Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys He Asp 
180 185 190 



Phe Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He 
195 200 205 



Leu Gin Lys Gly Gin He Gly Lys Lys Glu Arg Glu Lys Gin Met Leu 
210 215 220 



Ala Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr 
225 230 235 240 
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Ser Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly 
245 250 255 



Trp Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met 
260 265 270 



Arg Arg Cys Gin He He Arg Asp Met He Gly Pro Glu Lys Thr Leu 
275 280 285 



Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp 
290 295 300 



Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp He Glu Glu Pro Thr 
305 310 315 320 



Ser Pro Asp Asp He Leu Gly His Ala Thr He Ser Lys Ala Leu Val 
325 330 335 



Pro Leu Gly He Gly He Ala Thr Gly Glu Gin Cys His Asn Arg Val 
340 345 350 



He Phe Lys Gin Leu Leu Gin Ala Lys Ala Leu Gin Phe Leu Gin He 
355 360 365 



Asp Ser Cys Arg Leu Gly Ser Val Asn Glu Asn Leu Ser Val Leu Leu 
370 375 380 



Met Ala Lys Lys Phe Glu He Pro Val Cys Pro His Ala Gly Gly Val 
385 390 395 400 



Gly Leu Cys Glu Leu Val Gin His Leu He He Phe Asp Tyr He Ser 
405 410 415 



Val Ser Ala Ser Leu Glu Asn Arg Val Cys Glu Tyr Val Asp His Leu 
420 425 430 



His Glu His Phe Lys Tyr Pro Val Met He Gin Arg Ala Ser Tyr Met 
435 440 445 



Pro Pro Lys Asp Pro Gly Tyr Ser Thr Glu Met Lys Glu Glu Ser Val 
450 455 460 



Lys Lys His Gin Tyr Pro Asp Gly Glu Val Trp Lys Lys Leu Leu Pro 
465 470 475 480 



Ala Gin Glu Asn 
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<210> 192 

<211> 484 

<212> PRT 

<213> Homo sapien 

<400> 192 

His Gly Lys Arg Gly Arg His Gly Lys Arg Gly Arg His Gly Met Val 
15 10 15 



Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala 
20 25 30 



Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala 
35 40 45 



Asp Ala Met Val Ser Ala Asp Ala Met Val Ser Ala Asp Ala Met Val 
50 55 60 



Ser Ala Asp Ala Met His Thr Asp Pro Asp Tyr Ser Ala Ala Tyr Val 
65 70 75 80 



Val He Glu Thr Asp Ala Glu Asp Gly He Lys Gly Cys Gly He Thr 
85 90 95 



Phe Thr Leu Gly Lys Gly Thr Glu Val Val Val Cys Ala Val Asn Ala 
100 105 110 



Leu Ala His His Val Leu Asn Lys Asp Leu Lys Asp He Val Gly Asp 
115 120 125 



Phe Arg Gly Phe Tyr Arg Gin Leu Thr Ser Asp Gly Gin Leu Arg Trp 
130 135 140 



He Gly Pro Glu Lys Gly Val Val His Leu Ala Thr Ala Ala Val Leu 
145 150 155 160 



Asn Ala Val Trp Asp Leu Trp Ala Lys Gin Glu Gly Lys Pro Val Trp 
165 170 175 



Lys Leu Leu Val Asp Met Asp Pro Arg Met Leu Val Ser Cys He Asp 
180 185 190 



Phe Arg Tyr He Thr Asp Val Leu Thr Glu Glu Asp Ala Leu Glu He 
195 200 205 
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Leu Gin Lys Gly Gin lie Gly Lys Lys Glu Arg Glu Lys Gin Met Leu 
210 215 220 



Ala Gin Gly Tyr Pro Ala Tyr Thr Thr Ser Cys Ala Trp Leu Gly Tyr 
225 230 235 240 



Ser Asp Asp Thr Leu Lys Gin Leu Cys Ala Gin Ala Leu Lys Asp Gly 
245 250 255 



Trp Thr Arg Phe Lys Val Lys Val Gly Ala Asp Leu Gin Asp Asp Met 
260 265 270 



Arg Arg Cys Gin lie lie Arg Asp Met lie Gly Pro Glu Lys Thr Leu 
275 280 285 



Met Met Asp Ala Asn Gin Arg Trp Asp Val Pro Glu Ala Val Glu Trp 
290 295 300 



Met Ser Lys Leu Ala Lys Phe Lys Pro Leu Trp lie Glu Glu Pro Thr 
305 310 315 320 



Ser Pro Asp Asp lie Leu Gly His Ala Thr He Ser Lys Ala Leu Val 
325 330 335 



Pro Leu Gly He Gly He Ala Thr Gly Glu Gin Cys His Asn Arg Val 
340 345 350 



He Phe Lys Gin Leu Leu Gin Ala Lys Ala Leu Gin Phe Leu Gin He 
355 360 365 



Asp Ser Cys Arg Leu Gly Ser Val Asn Glu Asn Leu Ser Val Leu Leu 
370 375 380 



Met Ala Lys Lys Phe Glu He Pro Val Cys Pro His Ala Gly Gly Val 
385 390 395 400 



Gly Leu Cys Glu Leu Val Gin His Leu He He Phe Asp Tyr He Ser 
405 410 415 



Val Ser Ala Ser Leu Glu Asn Arg Val Cys Glu Tyr Val Asp His Leu 
420 425 430 



His Glu His Phe Lys Tyr Pro Val Met He Gin Arg Ala Ser Tyr Met 
435 440 445 
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Pro Pro Lys Asp Pro Gly Tyr Ser Thr Glu Met Lys Glu Glu Ser Val 
450 455 460 



Lys Lys His Gin Tyr Pro Asp Gly Glu Val Trp Lys Lys Leu Leu Pro 
465 470 475 480 



Ala Gin Glu Asn 



<210> 193 

<211> 138 

<212> PRT 

<213> Homo sapien 

<400> 193 

Trp lie Val Val Ala Ala Arg Tyr Arg lie Arg Leu Gly Leu Tyr Leu 
15 10 15 



Thr Leu Ala Ser Glu Val Tyr Tyr Thr Arg Leu Gly Asn Asp Phe His 
20 25 30 



Thr Asn Lys Arg Val Cys Glu Glu He Ala He He Pro Ser Lys Lys 
35 40 45 



Leu Arg Asn Lys He Ala Gly Tyr Val Thr His Leu Met Lys Arg He 
50 55 60 



Gin Arg Gly Pro Val Arg Gly He Ser He Lys Leu Gin Glu Glu Glu 
65 70 75 80 



Arg Glu Arg Arg Asp Asn Tyr Val Pro Glu Val Ser Ala Leu Asp Gin 
85 90 95 



Glu He He Glu Val Asp Pro Asp Thr Lys Glu Met Leu Lys Leu Leu 
100 105 110 



Asp Phe Gly Ser Leu Ser Asn Leu Gin Val Thr Gin Pro Thr Val Gly 
115 120 125 



Met Asn Phe Lys Thr Pro Arg Gly Pro Val 
130 135 



<210> 194 

<211> 386 

<212> PRT 

<213> Homo sapien 



<400> 194 
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Met Pro Trp Ala Met lie Trp Asp Phe Thr Glu Pro Val Cys Arg Gly 
15 10 15 



Cys Val Asn Tyr Glu Gly Ala Asp Arg Val Glu Phe Val He Glu Thr 
20 25 30 



Ala Arg Gin Leu Lys Arg Ala His Gly Cys Phe Pro Glu Gly Arg Ser 
35 40 45 



Pro Pro Gly Ala Ala Ala Ser Ala Ala Ala Lys Pro Pro Pro Leu Ser 
50 55 60 



Ala Lys Asp He Leu Leu Gin Gin Gin Gin Gin Leu Gly His Gly Gly 
65 70 75 80 



Pro Glu Ala Ala Pro Arg Ala Pro Gin Ala Leu Glu Arg Tyr Pro Leu 
85 90 95 



Ala Ala Ala Ala Glu Arg Pro Pro Arg Leu Gly Ser Asp Phe Gly Ser 
100 105 110 



Ser Arg Pro Ala Ala Ser Leu Ala Gin Pro Pro Thr Pro Gin Pro Pro 
115 120 125 



Pro Val Asn Gly He Leu Val Pro Asn Gly Phe Ser Lys Leu Glu Glu 
130 135 140 



Pro Pro Glu Leu Asn Arg Gin Ser Pro Asn Pro Arg Arg Gly His Ala 
145 150 155 160 



Val Pro Pro Thr Leu Val Pro Leu Met Asn Gly Ser Ala Thr Pro Leu 
165 170 175 



Pro Thr Ala Leu Gly Leu Gly Gly Arg Ala Ala Ala Ser Leu Ala Ala 
180 185 190 



Val Ser Gly Thr Ala Ala Ala Ser Leu Gly Ser Ala Gin Pro Thr Asp 
195 200 205 



Leu Gly Ala His Lys Arg Pro Ala Ser Val Ser Ser Ser Ala Ala Val 
210 215 220 



Glu His Glu Gin Arg Glu Ala Ala Ala Lys Glu Lys Gin Pro Pro Pro 
225 230 235 240 
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Pro Ala His Arg Gly Pro Ala Asp Ser Leu Ser Thr Ala Ala Gly Ala 
245 250 255 



Ala Glu Leu Ser Ala Glu Gly Ala Gly Lys Ser Arg Gly Ser Gly Glu 
260 265 270 



Gin Asp Trp val Asn Arg Pro Lys Thr Val Arg Asp Thr Leu Leu Ala 
275 280 285 



Leu His Gin His Gly His Ser Gly Pro Phe Glu Ser Lys Phe Lys Lys 
290 295 300 



Glu Pro Ala Leu Thr Ala Gly Arg Leu Leu Gly Phe Glu Ala Asn Gly 
305 310 315 320 



Ala Asn Gly Ser Lys Ala Gly Arg Gly Gly Cys Glu Val Arg Gly Ser 
325 330 335 



Arg Gly Glu Lys Gly Thr Glu Ser Arg Gly Arg Val Val Leu Trp He 
340 345 350 



His His Phe Thr Pro Ala Gin Lys Gin Gin Thr Pro His Phe Leu He 
355 360 365 



Cys Leu Arg Arg Asn Gin Cys Leu Val Ala Thr Cys Ser Cys Ala Glu 
370 375 380 



Ala Ala 



385 




<210> 


195 


<211> 


492 


<212> 


PRT 


<213> 


Homo sap i en 


<400> 


195 


Met Pro Trp Ala Met 


1 


5 



10 15 



Cys Val Asn Tyr Glu Gly Ala Asp Arg Val Glu Phe Val He Glu Thr 
20 25 30 



Ala Arg Gin Leu Lys Arg Ala His Gly Cys Phe Pro Glu Gly Arg Ser 
35 40 45 



Pro Pro Gly Ala Ala Ala Ser Ala Ala Ala Lys Pro Pro Pro Leu Ser 
50 55 60 
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Ala Lys Asp lie Leu Leu Gin Gin Gin Gin Gin Leu Gly His Gly Gly 
65 70 75 80 



Pro Glu Ala Ala Pro Arg Ala Pro Gin Ala Leu Glu Arg Tyr Pro Leu 
65 90 95 



Ala Ala Ala Ala Glu Arg Pro Pro Arg Leu Gly Ser Asp Phe Gly Ser 
100 105 110 



Ser Arg Pro Ala Ala Ser Leu Ala Gin Pro Pro Thr Pro Gin Pro Pro 
115 120 125 



Pro Val Asn Gly lie Leu Val Pro Asn Gly Phe Ser Lys Leu Glu Glu 
130 135 140 



Pro Pro Glu Leu Asn Arg Gin Ser Pro Asn Pro Arg Arg Gly His Ala 
145 150 155 160 



Val Pro Pro Thr Leu Val Pro Leu Met Asn Gly Ser Ala Thr Pro Leu 
165 170 175 



Pro Thr Ala Leu Gly Leu Gly Gly Arg Ala Ala Ala Ser Leu Ala Ala 
180 185 190 



Val Ser Gly Thr Ala Ala Ala Ser Leu Gly Ser Ala Gin Pro Thr Asp 
195 200 205 



Leu Gly Ala His Lys Arg Pro Ala Ser Val Ser Ser Ser Ala Ala Val 
210 215 220 



Glu His Glu Gin Arg Glu Ala Ala Ala Lys Glu Lys Gin Pro Pro Pro 
225 230 235 240 



Pro Ala His Arg Gly Pro Ala Asp Ser Leu Ser Thr Ala Ala Gly Ala 
245 250 255 



Ala Glu Leu Ser Ala Glu Gly Ala Gly Lys Ser Arg Gly Ser Gly Glu 
260 265 270 



Gin Asp Trp Val Asn Arg Pro Lys Thr Val Arg Asp Thr Leu Leu Ala 
275 280 285 



Leu His Gin His Gly His Ser Gly Pro Phe Glu Ser Lys Phe Lys Lys 
290 295 300 
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Glu Pro Ala Leu Thr Ala Val Ala Arg Thr Ala Arg Lys Arg Lys Pro 
305 310 315 320 



Ser Pro Glu Pro Glu Gly Glu Val Gly Pro Pro Lys lie Asn Gly Glu 
325 330 335 



Ala Gin Pro Trp Leu Ser Thr Ser Thr Glu Gly Leu Lys lie Pro Met 
340 345 350 



Thr Pro Thr Ser Ser Phe Val Ser Pro Pro Pro Pro Thr Ala Ser Pro 
355 360 365 



Hifi Ser Asn Arg Thr Thr Pro Pro Glu Ala Ala Gin Asn Gly Gin Ser 
370 375 380 



Pro Met Ala Ala Leu lie Leu Val Ala Asp Asn Ala Gly Gly Ser His 
385 390 395 400 



Ala Ser Lys Asp Ala Asn Gin Val His Pro Leu Trp Gin Pro Val Pro 
405 410 415 



Arg Cys Ala Ala Pro Ser Ala Thr Ser Gly Trp Arg Thr Pro lie Leu 
420 425 430 



Cys Ser Ala Arg Pro Ser Leu Arg Thr Ser Ser Ala Ser Leu Ala Pro 
435 440 445 



Asp Lys Ala Ser Asn Ser Arg Glu Leu Val Glu Arg Ser He Val Pro 
450 455 460 



Val Gly Lys Asn Ala Leu Leu Trp Ala Pro Met Ser Pro Gly Pro Leu 
465 470 475 480 



Cys Lys Gly Lys Leu Gin Pro Ser Leu Leu Glu Met 
485 490 



<210> 196 

<211> 358 

<212> PRT 

<213> Homo sapien 

<400> 196 

Met Ser Gly Val Arg Pro Pro He Met Asn Gly Pro Leu His Pro Arg 
15 10 15 



Pro Leu Val Ala Leu Leu Asp Gly Arg Asp Cys Thr Val Glu Met Pro 



wo 2004/050860 



PCTAJS2003/040063 



264 



20 



25 



30 



lie Leu Lys Asp Val Ala Thr Val Ala Phe Cys Asp Ala Gin Ser Thr 
35 40 45 



Gin Glu lie His Glu Lys Val Leu Asn Glu Ala Val Gly Ala Leu Met 
50 55 60 



Tyr His Thr He Thr Leu Thr Arg Glu Asp Leu Glu Lys Phe Lys Ala 
65 70 75 80 



Leu Arg He He Val Arg lie Gly Ser Gly Phe Asp Asn He Asp He 
85 90 95 



Lys Ser Ala Gly Asp Leu Gly He Ala Val Cys Asn Val Pro Ala Ala 
100 105 110 



Ser Val Glu Glu Thr Ala Asp Ser Thr Leu Cys His He Leu Asn Leu 
115 120 125 



Tyr Arg Arg Ala Thr Trp Leu His Gin Ala Leu Arg Glu Gly Thr Arg 
130 135 140 



Val Gin Ser Val Glu Gin He Arg Glu Val Ala Ser Gly Ala Ala Arg 
145 150 155 160 



He Arg Gly Glu Thr Leu Gly He He Gly Leu Gly Arg Val Gly Gin 
165 170 175 



Ala Val Ala Leu Arg Ala Lys Ala Phe Gly Phe Asn Val Leu Phe Tyr 
180 185 190 



Asp Pro Tyr Leu Ser Asp Gly Val Glu Arg Ala Leu Gly Leu Gin Arg 
195 200 205 



Val Ser Thr Leu Gin Asp Leu Leu Phe His Ser Asp Cys Val Thr Leu 
210 215 220 



His Cys Gly Leu Asn Glu His Asn His His Leu He Asn Asp Phe Thr 
225 230 235 240 



Val Lys Gin Met Arg Gin Gly Ala Phe Leu Val Asn Thr Ala Arg Gly 
245 250 255 



Gly Leu Val Asp Glu Lys Ala Leu Ala Gin Ala Leu Lys Glu Gly Arg 
260 265 270 
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He Arg Gly Ala Ala Leu Asp Val His Glu Ser Glu Pro Phe Ser Phe 
275 280 285 



Ser Gin Gly Pro Leu Lys Asp Ala Pro Asn Leu He Cys Thr Pro His 
290 295 300 



Ala Ala Trp Tyr Ser Glu Gin Ala Ser He Glu Met Arg Glu Glu Ala 
305 310 315 320 



Ala Arg Glu He Arg Arg Ala He Thr Gly Arg He Pro Asp Ser Leu 
325 330 335 



Lys Asn Phe Cys Pro Val Ser Phe Ala Phe Leu Val Lys Gin Lys Lys 
340 345 350 



Ser Val Val He Leu Pro 
355 



<210> 197 

<211> 364 

<212> PRT 

<213> Homo sapien 

<400> 197 

Met Gly Pro Gly His Gly Val Met Ala Ser Arg Pro Asp Leu Gin Pro 
15 10 15 



Leu Gin His Leu Gly Thr Pro Gly Ser Pro Gly Leu Asp Val Gin Pro 
20 25 30 



Gin Glu Glu Thr Pro Pro Gin Gly Gin Tyr Gin Pro Ala Ala Pro Gly 
35 40 45 



Ala Thr Asp Pro Leu Ala Gly Arg Gly Gin Ala Ala Cys Pro Pro He 
50 55 60 



Arg Ala Pro Pro Thr Arg Asp Leu Glu He Lys Ser Leu Gly Leu Pro 
65 70 75 80 



His Pro Pro Leu Ser Gly Ala Pro Gly Val Ser Asp Gly Pro Gly Ala 
85 90 95 



Val Leu Leu Ser Ser Ala Ser Leu Pro Ser Arg Ala Gly Pro Trp Gly 
100 105 110 
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Leu Trp Phe Pro Gly Arg Ala Pro His Arg Gly Phe Gin Cys Gin Pro 
115 120 125 



Pro Pro Leu Arg Thr Gin Pro Gin His Ser Gly Cys Thr Asp His Ala 
130 135 140 



Cys Ala Val Pro Ser Phe Ser Gin Gly Pro Leu Lys Asp Ala Pro Asn 
145 150 155 160 



Leu lie Cys Thr Pro His Ala Ala Trp Tyr Ser Glu Gin Ala Ser He 
165 170 175 



Glu Met Arg Glu Glu Ala Ala Arg Glu He Arg Arg Ala He Thr Gly 
180 185 190 



Arg He Pro Asp Ser Leu Lys Asn Cys Val Asn Lys Asp His Leu Thr 
195 200 205 



Ala Ala Thr His Trp Ala Ser Met Asp Pro Ala Val Val His Pro Glu 
210 215 220 



Leu Asn Gly Ala Ala Tyr Ser Arg Gly Thr Leu Arg Ala Trp Trp Ala 
225 230 235 240 



Trp Pro Pro Leu Ala Ser Gin Leu Leu Trp Lys Val Ser Ser Pro Ala 
245 250 255 



Pro Cys Pro Cys Pro Thr Ala Cys Pro Leu Trp Pro Thr Arg Pro Thr 
260 265 270 



Pro Leu Leu Leu Ala Lys Pro Ser Ser Pro Arg Arg He Glu Thr Thr 
275 280 285 



Pro Val Thr Ser Cys Ser Pro Gly Gly Ala Leu Gin Pro Arg Arg Leu 
290 295 300 



Gly Arg Gly Pro Gly Asn Pro Arg Thr Arg Val Cys Gly Gly Gly He 
305 310 315 320 



Cys Val Val Ala Leu Ala Leu Gin Arg Leu Val Arg Ala Val Arg Arg 
325 330 335 



Arg Glu Gly Ala Ala Leu Gly Leu Val Ser Leu Val Val Val Arg Pro 
340 345 350 



Val Gly Ala Leu Pro Cys Val Leu Arg Val Pro Arg 
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355 


<210> 


198 


<211> 


192 


<212> 


PRT 


<213> 


Homo sap i en 


<400> 


198 



267 

360 



Ala Gin Pro Ala Cys Arg Ala Glu Arg Gly Arg Gly Val Cys Gly Ser 
15 10 15 



Gin Ala Gly Pro Pro Thr Gly Gly Ser Ser Ala Gin Pro Pro Pro Leu 
20 25 30 



Arg Thr Gin Pro Gin His Ser Gly Cys Thr Asp His Ala Cys Ala Val 
35 40 45 



Pro Ser Phe Ser Gin Gly Pro Leu Lys Asp Ala Pro Asn Leu lie Cys 
50 55 60 



Thr Pro His Ala Ala Trp Tyr Ser Glu Gin Ala Ser He Glu Met Arg 
65 70 75 80 



Glu Glu Ala Ala Arg Glu He Arg Arg Ala He Thr Gly Arg He Pro 
85 90 95 



Asp Ser Leu Lys Asn Cys Val Asn Lys Asp His Leu Thr Ala Ala Thr 
100 105 110 



His Trp Ala Ser Met Asp Pro Ala Val Val His Pro Glu Leu Asn Gly 
115 120 125 



Ala Ala Tyr Arg Tyr Pro Pro Gly Val Val Gly Val Ala Pro Thr Gly 
130 135 140 



He Pro Ala Ala Val Glu Gly He Val Pro Ser Ala Met Ser Leu Ser 
145 150 155 160 



His Gly Leu Pro Pro Val Ala His Pro Pro His Ala Pro Ser Pro Gly 
165 170 . 175 



Gin Thr Val Lys Pro Glu Ala Asp Arg Asp His Ala Ser Asp Gin Leu 
180 IBS 190 



<210> 199 
<211> 178 
<212> PRT 
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<213> Homo sapien 
<400> 199 

Met Arg Glu Glu Ala Pro Phe Ser Phe Ser Gin Gly Pro Leu Lys Asp 
15 10 15 



Ala Pro Asn Leu lie Cys Thr Pro His Ala Ala Trp Tyr Met Asp Pro 
20 25 30 



Ala Val Val His Pro Glu Leu Asn Gly Ala Ala Tyr Ser Arg Gly Thr 
35 40 45 



Leu Arg Ala Trp Trp Ala Trp Pro Pro Leu Ala Ser Gin Leu Leu Trp 
50 55 60 



Lys Val Ser Ser Pro Ala Pro Cys Pro Cys Pro Thr Ala Cys Pro Leu 
65 70 75 80 



Trp Pro Thr Arg Pro Thr Pro Leu Leu Leu Ala Lys Pro Ser Ser Pro 
85 90 95 



Arg Arg lie Glu Thr Thr Pro Val Thr Ser Cys Ser Pro Gly Gly Ala 
100 105 110 



Leu Gin Pro Arg Arg Leu Gly Arg Gly Pro Gly Asn Pro Arg Thr Arg 
115 120 125 



Val Cys Gly Gly Gly He Cys Val Val Ala Leu Ala Leu Gin Arg Leu 
130 135 140 



Val Arg Ala Val Arg Arg Arg Glu Gly Ala Ala Leu Gly Leu Val Ser 
145 150 155 160 



Leu Val Val Val Arg Pro Val Gly Ala Leu Pro Cys Val Leu Arg Val 
165 170 175 



Pro Arg 



<210> 200 

<211> 162 

<212> PRT 

<213> Homo sapien 

<400> 200 

Arg Met His Pro Thr Ser Ser Ala Pro Pro Met Leu His Gly Thr Trp 
15 10 15 
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Thr Pro Pro Ser Cys Thr Leu Ser Ser Met Gly Leu Pro He Gly Thr 
20 25 30 



Leu Arg Ala Trp Trp Ala Trp Pro Pro Leu Ala Ser Gin Leu Leu Trp 
35 40 45 



Lys Val Ser Ser Pro Ala Pro Cys Pro Cys Pro Thr Ala Cys Pro Leu 
50 55 60 



Trp Pro Thr Arg Pro Thr Pro Leu Leu Leu Ala Lys Pro Ser Ser Pro 
65 70 75 80 



Arg Arg He Glu Thr Thr Pro Val Thr Ser Cys Ser Pro Gly Gly Ala 
85 90 95 



Leu Gin Pro Arg Arg Leu Gly Arg Gly Pro Gly Asn Pro Arg Thr Arg 
100 105 110 



Val Cys Gly Gly Gly He Cys Val Val Ala Leu Ala Leu Gin Arg Leu 
115 120 125 



Val Arg Ala Val Arg Arg Arg Glu Gly Ala Ala Leu Gly Leu Val Ser 
130 135 140 



Leu Val Val Val Arg Pro Val Gly Ala Leu Pro Cys Val Leu Arg Val 
145 150 155 160 



Pro Arg 



<210> 201 

<211> 272 

<212> PRT 

<213> Homo sapien 

<400> 201 

Ala Ser Cys Gly Val Gly Arg Leu Val Gly Trp Gly He Ser Gly Gly 
15 10 15 



Gly Ala Ser Leu Gly Pro Gly His Leu Gly Gly Gly Ala Ser Trp Gly 
20 25 30 



Arg Gly He Ser Glu Gly Ala Ser Gly Gly Trp Ser He Leu Gly Gly 
35 40 45 
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Gly Ser Arg Trp Gin Arg Gly Phe Pro Gin Leu Ala Gly Gly Val lie 
50 55 60 



Leu Gly Val Ala Leu Trp Leu Arg His Asp Pro Gin Thr Thr Asn Leu 
65 70 75 80 



Leu Tyr Leu Glu Leu Gly Asp Lys Pro Ala Pro Asn Thr Phe Tyr Val 
85 90 95 



Gly lie Tyr He Leu He Ala Val Gly Ala Val Met Met Phe Val Gly 
100 105 110 



Phe Leu Gly Cys Tyr Gly Ala He Gin Glu Ser Gin Cys Leu Leu Gly 
115 120 125 



Thr Phe Phe Thr Cys Leu Val He Leu Phe Ala Cys Glu Val Ala Ala 
130 135 140 



Gly He Trp Gly Phe Val Asn Lys Asp Gin He Ala Lys Asp Val Lys 
145 150 155 160 



Gin Phe Tyr Asp Gin Ala Leu Gin Gin Ala Val Val Asp Asp Asp Ala 
165 170 175 



Asn Asn Ala Lys Ala Val Val Lys Thr Phe His Glu Thr Leu Asp Cys 
180 185 190 



Cys Gly Ser Ser Thr Leu Thr Ala Leu Thr Thr Ser Val Leu Lys Asn 
195 200 205 



Asn Leu Cys Pro Ser Gly Ser Asn He He Ser Asn Leu Phe Lys Glu 
210 215 220 



Asp Cys His Gin Lys He Asp Asp Leu Phe Ser Gly Lys Leu Tyr Leu 
225 230 235 240 



He Gly He Ala Ala He Val Val Ala Val He Met He Phe Glu Met 
245 250 255 



He Leu Ser Met Val Leu Cys Cys Gly He Arg Asn Ser Ser Val Tyr 
260 265 270 



<210> 202 

<211> 303 

<212> PRT 

<213> Homo sapien 
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<400> 202 

Met Ser Gly Ala Val Thr Ser His Leu Pro Gin Ala Gly Leu Phe Cys 
15 10 15 



Thr Ala Cys Leu Gly Arg Trp Trp Glu Ser Leu Trp Pro Ser Ala Leu 
20 25 30 



Pro Trp Gin Trp Gly Gin Leu Gly His Leu Gly Gly Ala Arg Leu Pro 
35 40 45 



Gin Ala Arg Pro Trp Asp Leu Ser Arg Cys Leu Val Val Ala Cys Phe 
50 55 60 



Ser Pro Gly Met Trp Glu Arg His Gin Thr Gin Asp Val Pro Leu Pro 
65 70 75 80 



Ala Pro Glu Ala Pro Ser Pro Asp Glu Leu Ala Gly Gly Val lie Leu 
85 90 95 



Gly Val Ala Leu Trp Leu Arg His Asp Pro Gin Thr Thr Asn Leu Leu 
100 105 110 



Tyr Leu Glu Leu Gly Asp Lys Pro Ala Pro Asn Thr Phe Tyr Val Gly 
115 120 125 



lie Tyr lie Leu lie Ala Val Gly Ala Val Met Met Phe Val Gly Phe 
130 135 140 



Leu Gly Cys Tyr Gly Ala lie Gin Glu Ser Gin Cys Leu Leu Gly Thr 
145 150 155 160 



Phe Phe Thr Cys Leu Val lie Leu Phe Ala Cys Glu Val Ala Ala Gly 
165 170 175 



lie Trp Gly Phe Val Asn Lys Asp Gin lie Ala Lys Asp Val Lys Gin 
180 185 190 



Phe Tyr Asp Gin Ala Leu Gin Gin Ala Val Val Asp Asp Asp Ala Asn 
195 200 205 



Asn Ala Lys Ala Val Val Lys Thr Phe His Glu Thr Leu Asp Cys Cys 
210 215 220 



Gly Ser Ser Thr Leu Thr Ala Leu Thr Thr Ser Val Leu Lys Asn Asn 
225 230 235 240 
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Leu Cys Pro Ser Gly Ser Asn He He Ser Asn Leu Phe Lys Glu Asp 
245 250 255 



Cys His Gin Lys He Asp Asp Leu Phe Ser Gly Lys Leu Tyr Leu He 
260 265 270 



Gly He Ala Ala He Val Val Ala Val He Met He Phe Glu Met He 
275 280 285 



Leu Ser Met Val Leu Cys Cys Gly He Arg Asn Ser Ser Val Tyr 
290 295 300 



<210> 203 

<211> 420 

<212> PRT 

<213> Homo sapien 

<400> 203 

Met Leu Pro Ser Gin Gly Ala Trp Gly Ser Ser Gly Gly Leu Ala Tyr 
15 10 15 



Thr Pro Trp Ser Ser Cys Pro Arg Trp Gly Ala Gly Leu Gin Pro Ser 
20 25 30 



Ala Gin Gly Leu Gly He Gin Leu Asp Pro Pro His Thr Ala Ala Arg 
35 40 45 



Phe Lys Cys Arg Ser Arg Asn Gly Ser Ala Ala Val Gin Pro Arg Leu 
50 55 60 



Gly Gly Arg Ser Gin Gin Gly Pro Pro Thr Leu Phe Ser His His Thr 
65 70 75 80 



Gly Glu Ala Ala Leu Val Pro Val Pro Val Pro Gly Leu Pro Ser Gin 
85 90 95 



Pro Arg Pro Thr Val Gly Pro Thr Leu Cys Leu Leu Met Pro Leu Pro 
100 105 110 



Pro His Ala Lys Ser Gin Arg Leu Trp Glu Arg Val Lys Ala Val Gly 
115 120 125 



Gly Gly Trp Gin Val Gin Ala Val Gly Gly Gly Cys Gly Arg Trp Arg 
130 135 140 



Ala Pro Pro Gin Val Ser Ser Cys Glu Ala Pro Val Ala Ser Thr Gin 
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145 



150 



155 



160 



Ser Ala His Glu Val Pro Ser Pro His Val Ala Ser Leu Val Ser Val 
165 170 175 



Cys Val Met Glu Glu Val Thr Glu Ala Gin Lys Thr His Gin Ala Arg 
180 185 190 



Leu Gly Cys Glu Val Pro Cys Cys Ser Ser Leu Ala Val Ser Asn Pro 
195 200 205 



Thr Ser Ser Gin Leu Gly Gly Pro Trp Trp Val Arg His Pro Gly Pro 
210 215 220 



Ser Gly Val Leu Gly Cys Gly Glu Cys Val Gly Thr His Leu Val Ser 
225 230 235 240 



Leu Ser Pro Gin Gly He Tyr He Leu He Ala Val Gly Ala Val Met 
245 250 255 



Met Phe Val Gly Phe Leu Gly Cys Tyr Gly Ala He Gin Glu Ser Gin 
260 265 270 



Cys Leu Leu Gly Thr Phe Phe Thr Cys Leu Val He Leu Phe Ala Cys 
275 280 285 



Glu Val Ala Ala Gly He Trp Gly Phe Val Asn Lys Asp Gin He Ala 
290 295 300 



Lys Asp Val Lys Gin Phe Tyr Asp Gin Ala Leu Gin Gin Ala Val Val 
305 310 315 320 



Asp Asp Asp Ala Asn Asn Ala Lys Ala Val Val Lys Thr Phe His Glu 
325 330 335 



Thr Leu Asp Cys Cys Gly Ser Ser Thr Leu Thr Ala Leu Thr Thr Ser 
340 345 350 



Val Leu Lys Asn Asn Leu Cys Pro Ser Gly Ser Asn He He Ser Asn 
355 360 365 



Leu Phe Lys Glu Asp Cys His Gin Lys He Asp Asp Leu Phe Ser Gly 
370 375 380 



Lys Leu Tyr Leu He Gly He Ala Ala He Val Val Ala Val He Met 
385 390 395 400 
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lie Phe Glu Met He Leu Ser Met Val Leu Cys Cys Gly He Arg Asn 
405 410 415 



Ser Ser Val Tyr 
420 



<210> 204 

<211> 247 

<212> PRT 

<213> Homo sapien 

<400> 204 

Ser Pro Ser Cys Val Met Glu Glu Val Thr Glu Ala Gin Lys Thr His 
15 10 15 



Gin Ala Arg Leu Gly Cys Glu Val Pro Cys Cys Ser Ser Leu Ala Val 
20 25 30 



Ser Asn Pro Thr Ser Ser Gin Leu Gly Gly Pro Trp Trp Val Arg His 
35 40 45 



Pro Gly Pro Ser Gly Val Leu Gly Cys Gly Glu Cys Val Gly Thr His 
50 55 60 



Leu Val Ser Leu Ser Pro Gin Gly He Tyr He Leu He Ala Val Gly 
65 70 75 80 



Ala Val Met Met Phe Val Gly Phe Leu Gly Cys Tyr Gly Ala He Gin 
85 90 95 



Glu Ser Gin Cys Leu Leu Gly Thr Phe Phe Thr Cys Leu Val He Leu 
100 105 110 



Phe Ala Cys Glu Val Ala Ala Gly He Trp Gly Phe Val Asn Lys Asp 
115 120 125 



Gin He Ala Lys Asp Val Lys Gin Phe Tyr Asp Gin Ala Leu Gin Gin 
130 135 140 



Ala Val Val Asp Asp Asp Ala Asn Asn Ala Lys Ala Val Val Lys Thr 
145 150 155 160 



Phe His Glu Thr Leu Asp Cys Cys Gly Ser Ser Thr Leu Thr Ala Leu 
165 170 175 
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Thr Thr Ser Val Leu Lys Asn Asn Leu Cys Pro Ser Gly Ser Asn lie 
180 185 190 



lie Ser Asn Leu Phe Lys Glu Asp Cys His Gin Lys lie Asp Asp Leu 
195 200 205 



Phe Ser Gly Lys Leu Tyr Leu lie Gly lie Ala Ala lie Val Val Ala 
210 215 220 



Val He Met He Phe Glu Met He Leu Ser Met Val Leu Cys Cys Gly 
225 230 235 240 



He Arg Asn Ser Ser Val Tyr 
245 



<210> 205 

<211> 236 

<212> PRT 

<213> Homo sap i en 

<400> 205 

Met Gly Val Glu Gly Cys Thr Lys Cys He Lys Tyr Leu Leu Phe Val 
15 10 15 



Phe Asn Phe Val Phe Trp Leu Ala Gly Gly Val He Leu Gly Val Ala 
20 25 30 



Leu Trp Leu Arg His Asp Pro Gin Thr Thr Asn Leu Leu Tyr Leu Glu 
35 40 45 



Leu Gly Asp Lys Pro Ala Pro Asn Thr Phe Tyr Val Gly He Tyr He 
50 55 60 



Leu He Ala Val Gly Ala Val Met Met Phe Val Gly Phe Leu Gly Cys 
65 70 75 80 



Tyr Gly Ala He Gin Glu Ser Gin Cys Leu Leu Gly Thr Phe Phe Thr 
85 90 95 



Cys Leu Val He Leu Phe Ala Cys Glu Val Ala Ala Gly He Trp Gly 
100 105 110 



Phe val Asn Lys Asp Gin He Ala Lys Asp Val Lys Gin Phe Tyr Asp 
115 120 125 



Gin Ala Leu Gin Gin Ala Val Val Asp Asp Asp Ala Asn Asn Ala Lys 
130 135 140 
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Ala Val val Lys Thr Phe His Glu Thr Leu Asp Cys Cys Gly Ser Ser 
145 150 155 160 



Thr Leu Thr Ala Leu Thr Thr Ser Val Leu Lys Asn Asn Leu Cys Pro 
165 170 175 



Ser Gly Ser Asn lie He Ser Asn Leu Phe Lys Glu Asp Cys His Gin 
180 185 190 



Lys He Asp Asp Leu Phe Ser Gly Lys Leu Tyr Leu He Gly He Ala 
195 200 205 



Ala He Val Val Ala Val He Met He Phe Glu Met He Leu Ser Met 
210 215 220 



Val Leu Cys Cys Gly He Arg Asn Ser Ser Val Tyr 
225 230 235 



<210> 206 

<211> 256 

<212> PRT 

<213> Homo sapien 

<400> 206 

Met Gly Val Glu Gly Cys Thr Lys Cys He Lys Tyr Leu Leu Phe Val 
15 10 15 



Phe Asn Phe Val Phe Trp Leu Ala Gly Gly Val He Leu Gly Val Ala 
20 25 30 



Leu Trp Leu Arg His Asp Pro Gin Thr Thr Asn Leu Leu Tyr Leu Glu 
35 40 45 



Leu Gly Asp Lys Pro Ala Pro Asn Thr Phe Tyr Val Gly He Tyr He 
50 55 60 



Leu He Ala Val Gly Ala Val Met Met Phe Val Gly Phe Leu Gly Cys 
65 70 75 80 



Tyr Gly Ala He Gin Glu Ser Gin Cys Leu Leu Gly Thr Phe Phe Thr 
85 90 95 



Cys Leu Val He Leu Phe Ala Cys Glu Val Ala Ala Gly He Trp Gly 
100 105 110 
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Phe Val Asn Lys Asp Gin He Ala Lys Asp Val Lys Gin Phe Tyr Asp 
115 120 125 



Gin Ala Leu Gin Gin Ala Val Val Asp Asp Asp Ala Asn Asn Ala Lys 
130 135 140 



Ala Val Val Lys Thr Phe His Glu Thr Leu Asp Cys Cys Gly Ser Ser 
145 150 155 160 



Thr Leu Thr Ala Leu Thr Thr Ser Val Leu Lys Asn Asn Leu Cys Pro 
165 170 175 



Ser Gly Ser Asn He lie Ser Asn Leu Phe Lys Glu Asp Cys His Gin 
180 185 190 



Lys He Asp Asp Leu Phe Ser Gly Lys Leu Tyr Leu He Gly He Ala 
195 200 205 



Ala He Val Val Ala Val He Met He Phe Glu Met He Leu Ser Met 
210 215 220 



Val Leu Asn Asp Asn Leu Cys He He Gly Lys Val Arg He Ser Gly 
225 230 235 240 



Arg Gin Gly Phe Tyr Pro Asn Gin Gin His Lys Arg Gin Tyr Asn Cys 
245 250 255 



<210> 207 

<211> 210 

<212> PRT 

<213> Homo sapien 

<400> 207 

Met Gly Val Glu Gly Cys Thr Lys Cys He Lys Tyr Leu Leu Phe Val 
15 10 15 



Phe Asn Phe Val Phe Trp Leu Ala Gly Gly Val He Leu Gly Val Ala 
20 25 30 



Leu Trp Leu Arg His Asp Pro Gin Thr Thr Asn Leu Leu Tyr Leu Glu 
35 40 45 



Leu Gly Asp Lys Pro Ala Pro Asn Thr Phe Tyr Val Gly He Tyr He 
50 55 60 



Leu He Ala Val Gly Ala Val Met Met Phe Val Gly Phe Leu Gly Cys 
65 70 75 80 
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Tyr Gly Ala He Gin Glu Ser Gin Cys Leu Leu Gly Thr Phe Phe Thr 
85 90 95 



Cys Leu Val He Leu Phe Ala Cys Glu Val Ala Ala Gly He Trp Gly 
100 105 110 



Phe Val Asn Lys Asp Gin He Ala Lys Asp Val Lys Gin Phe Tyr Asp 
115 120 125 



Gin Ala Leu Gin Gin Ala Val Val Asp Asp Asp Ala Asn Asn Ala Lys 
130 135 140 



Ala Val Val Lys Thr Phe His Glu Thr Leu Asp Cys Cys Gly Ser Ser 
145 150 155 160 



Thr Leu Thr Ala Leu Thr Thr Ser Val Leu Lys Asn Asn Leu Cys Pro 
165 170 175 



Ser Gly Ser Asn He He Ser Asn Leu Phe Lys Glu Asp Cys His Gin 
ISO 185 190 



Lys He Asp Asp Leu Phe Ser Gly Lys Leu Tyr Leu Ala Ala Thr Thr 
195 200 205 



Leu Arg 
210 



<210> 208 

<211> 58 

<212> PRT 

<213> Homo sapien 

<400> 208 

Asn His He Glu Pro Leu Lys He Gin Trp Leu Asp Val Leu Gin Arg 
15 10 15 



Glu Pro Arg Pro Phe Pro Lys Leu Arg He Leu Arg Lys Val Glu Lys 
20 25 30 



He Asp Asp Phe Lys Ala Glu Asp Phe Gin He Glu Gly Tyr Asn Pro 
35 40 45 



His Pro Thr He Lys Met Glu Met Ala Val 
50 55 
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<210> 209 
<211> 91 
<212> PRT 
<213> Homo sapien 

<400> 209 

Lys Phe Ser Gly Ser Met Cys Phe Ser Glu Asn Pro Asp Leu Ser Gin 
15 10 15 



Ser Ser Gly Phe Phe Glu Lys Leu Arg Lys Leu Met Thr Ser Lys Leu 
20 25 30 



Lys Thr Phe Arg Leu Lys Gly Thr He Arg He Gin Leu Leu Lys Trp 
35 40 45 



Lys Trp Leu Phe Arg Val Leu Ser Lys Glu Leu Glu Gly Tyr Cys Gin 
50 55 60 



Ser Leu Gly Val Gly Leu Asp Ala Glu Val Lys Val Leu Phe Ala Leu 
65 70 75 80 



Lys Glu Lys Gly Thr Arg Ser Lys He Cys Pro 
85 90 



<210> 210 

<211> 86 

<212> PRT 

<213> Homo sapien 

<400> 210 

Met Asp Asp Ser Glu Val Glu Ser Thr Ala Ser He Leu Ala Ser Val 
15 10 15 



Lys Glu Gin Glu Ala Gin Phe Glu Lys Leu Thr Arg Ala Leu Glu Glu 
20 25 30 



Glu Arg Arg His Val Ser Ala Gin Leu Glu Arg Val Arg Val Ser Pro 
35 40 45 



Gin Asp Ala Asn Pro Leu Met Ala Asn Gly Thr Ser Pro Phe Arg Lys 
50 55 60 



Lys Cys Lys Lys Lys Ser He Phe Ser Ser Arg Val Glu Leu Phe Lys 
65 70 75 80 



Glu Ser Lys He He Ser 
85 
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<210> 211 

<211> 107 

<212> PRT 

<213> Homo sapien 

<400> 211 

Met lie lie Tyr Tyr Met Val His Asn His Val Asp Ala Gin Cys Met 
15 10 15 



He Leu Gin Asn Arg Leu Ser Val Ser Arg Arg Val Leu Arg Gly Met 
20 25 30 



Val Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr Phe Gly Lys Leu Asp 
35 40 45 



Giy Gin He Ser Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val Leu Val 
50 55 60 



Gly He Tyr Gly Gin Tyr Gin Leu Leu Gly He Lys Ser He Gly Phe 
65 70 75 80 



Glu Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro Val Asn 
85 90 95 



Leu Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
100 105 



<210> 212 

<211> 90 

<212> PRT 

<213> Homo sapien 

<400> 212 

Tyr Cys Arg He Gly Leu Arg Val Ala Arg Val Leu Arg Gly Met Val 
15 10 15 



Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr Phe Gly Lys Leu Asp Gly 
20 25 30 



Gin He Ser Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val Leu Val Gly 
35 40 45 



He Tyr Gly Gin Tyr Gin Leu Leu Gly He Lys Ser He Gly Phe Glu 
50 55 60 



Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro Val Asn Leu 
65 70 75 80 
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Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
85 90 



<210> 213 

<211> 193 

<212> PRT 

<213> Homo sapien 

<400> 213 

Met Asp Glu Arg Pro Pro Gly Gin Val Thr Gly Glu Ser Pro Gly Met 
15 10 15 



His Arg Pro Glu Ala Met Leu Leu Leu Leu Thr Leu Ala Leu Leu Gly 
20 25 30 



Gly Pro Thr Trp Ala Gly Lys Met Tyr Gly Pro Gly Gly Gly Lys Tyr 
35 40 45 



Phe Ser Thr Thr Glu Asp Tyr Asp His Glu He Thr Gly Leu Arg Val 
50 55 60 



Ser Val Gly Leu Leu Leu Val Lys Ser Val Gin Val Lys Leu Gly Asp 
65 70 75 80 



Ser Trp Asp Val Lys Leu Gly Ala Leu Gly Gly Asn Thr Gin Glu Val 
85 90 95 



Thr Leu Gin Pro Gly Glu Tyr He Thr Lys Val Phe Val Ala Phe Gin 
100 105 110 



Ala Phe Leu Arg Gly Met Val Met Tyr Thr Ser Lys Asp Arg Tyr Phe 
115 120 125 



Tyr Phe Gly Lys Leu Asp Gly Gin He Ser Ser Ala Tyr Pro Ser Gin 
130 135 140 



Glu Gly Gin Val Leu Val Gly He Tyr Gly Gin Tyr Gin Leu Leu Gly 
145 150 155 160 



He Lys Ser He Gly Phe Glu Trp Asn Tyr Pro Leu Glu Glu Pro Thr 
165 170 175 



Thr Glu Pro Pro Val Asn Leu Thr Tyr Ser Ala Asn Ser Pro Val Gly 
180 185 190 
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Arq 




<210> 


214 


<211> 


189 


<212> 


PRT 


<213> 


Homo sap i en 


<400> 


214 



Ala Ala Ala Arg Ala Gly Gly Glu Ser Pro Gly Met His Arg Pro Glu 
15 10 15 



Ala Met Leu Leu Leu Leu Thr Leu Ala Leu Leu Gly Gly Pro Thr Trp 
20 25 30 



Ala Gly Lys Met Tyr Gly Pro Gly Gly Gly Lys Tyr Phe Ser Thr Thr 
35 40 45 



Glu Asp Tyr Asp His Glu lie Thr Gly Leu Arg Val Ser Val Gly Leu 
50 55 60 



Leu Leu Val Lys Ser Val Gin Val Lys Leu Gly Asp Ser Trp Asp Val 
65 70 75 80 



Lys Leu Gly Ala Leu. Gly Gly Asn Thr Gin Glu Val Thr Leu Gin Pro 
85 90 95 



Gly Glu Tyr lie Thr Lys Val Phe Val Ala Phe Gin Ala Phe Leu Arg 
100 105 110 



Gly Met Val Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr Phe Gly Lys 
115 120 125 



Leu Asp Gly Gin He Ser Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val 
130 135 140 



Leu Val Gly He Tyr Gly Gin Tyr Gin Leu Leu Gly He Lys Ser He 
145 150 155 160 



Gly Phe Glu Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro 
165 170 175 



Val Asn Leu Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
180 185 



<210> 215 
<211> 202 
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<212> PRT 

<213> Homo sapien 

<400> 215 

Met Asp Arg Pro Pro Gly Arg Trp Arg Val Pro Gly Thr Thr Arg Arg 
15 10 15 



Pro Val Thr Gly Glu Ser Pro Gly Met His Arg Pro Glu Ala Met Leu 
20 25 30 



Leu Leu Leu Thr Leu Ala Leu Leu Gly Gly Pro Thr Trp Ala Gly Lys 
35 40 45 



Met Tyr Gly Pro Gly Gly Gly Lys Tyr Phe Ser Thr Thr Glu Asp Tyr 
50 55 60 



Asp His Glu He Thr Gly Leu Arg Val Ser Val Gly Leu Leu Leu Val 
65 70 75 80 



Lys Ser Val Gin Val Lys Leu Gly Asp Ser Trp Asp Val Lys Leu Gly 
85 90 95 



Ala Leu Gly Gly Asn Thr Gin Glu Val Thr Leu Gin Pro Gly Glu Tyr 
100 105 110 



He Thr Lys Val Phe Val Ala Phe Gin Ala Phe Leu Arg Gly Met Val 
115 120 125 



Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr Phe Gly Lys Leu Asp Gly 
130 135 140 



Gin He Ser Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val Leu Val Gly 
145 150 155 160 



He Tyr Gly Gin Tyr Gin Leu Leu Gly He Lys Ser He Gly Phe Glu 
165 170 175 



Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro Val Asn Leu 
180 185 190 



Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
195 200 



<210> 216 

<211> 208 

<212> PRT 

<213> Homo sapien 
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<400> 216 



Cys Arg Ala Ala Gin Cys Asp Gly Ser Ala Ala Gly Gin Val Glu Gly 
15 10 15 



Ala Arg His Asn Gin Thr Pro Ser His Gly Glu Ser Pro Gly Met His 
20 25 30 



Arg Pro Glu Ala Met Leu Leu Leu Leu Thr Leu Ala Leu Leu Gly Gly 
35 40 45 



Pro Thr Trp Ala Gly Lys Met Tyr Gly Pro Gly Gly Gly Lys Tyr Phe 
50 55 60 



Ser Thr Thr Glu Asp Tyr Asp His Glu He Thr Gly Leu Arg Val Ser 
65 70 75 80 



Val Gly Leu Leu Leu Val Lys Ser Val Gin Val Lys Leu Gly Asp Ser 
85 90 95 



Trp Asp Val Lys Leu Gly Ala Leu Gly Gly Asn Thr Gin Glu Val Thr 
100 105 110 



Leu Gin Pro Gly Glu Tyr He Thr Lys Val Phe Val Ala Phe Gin Ala 
115 120 125 



Phe Leu Arg Gly Met Val Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr 
130 135 140 



Phe Gly Lys Leu Asp Gly Gin He Ser Ser Ala Tyr Pro Ser Gin Glu 
145 150 155 160 



Gly Gin Val Leu Val Gly He Tyr Gly Gin Tyr Gin Leu Leu Gly He 
165 170 175 



Lys Ser He Gly. Phe Glu Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr 
180 185 190 



Glu Pro Pro Val Asn Leu Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
195 200 205 



<210> 217 

<211> 189 

<212> PRT 

<213> Homo sapien 



<400> 217 
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Met His Val Glu Arg Arg Ser Val Met Asp Arg Gly Arg Gly Glu Val 
15 10 15 



Ala Met Leu Leu Leu Leu Thr Leu Ala Leu Leu Gly Gly Pro Thr Trp 
20 25 30 



Ala Gly Lys Met Tyr Gly Pro Gly Gly Gly Lys Tyr Phe Ser Thr Thr 
35 40 45 



Glu Asp Tyr Asp His Glu lie Thr Gly Leu Arg Val Ser Val Gly Leu 
50 55 60 



Leu Leu Val Lys Ser Val Gin Val Lys Leu Gly Asp Ser Trp Asp Val 
65 70 75 80 



Lys Leu Gly Ala Leu Gly Gly Asn Thr Gin Glu Val Thr Leu Gin Pro 
85 90 95 



Gly Glu Tyr He Thr Lys Val Phe Val Ala Phe Gin Ala Phe Leu Arg 
100 105 110 



Gly Met Val Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr Phe Gly Lys 
115 120 125 



Leu Asp Gly Gin He Ser Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val 
130 135 140 



Leu Val Gly He Tyr Gly Gin Tyr Gin Leu Leu Gly He Lys Ser He 
145 150 155 160 



Gly Phe Glu Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro 
165 170 175 



Val Asn Leu Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
180 185 



<210> 218 

<211> 171 

<212> PRT 

<213> Homo sapien 

<400> 218 

Met Leu Glu Arg Arg He Val Asn Gly Ser Pro Gly Gin Val Gin Ser 
15 10 15 



Gin Met Tyr Gly Pro Gly Gly Gly Lys Tyr Phe Ser Thr Thr Glu Asp 
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20 25 30 



Tyr Asp His Glu He Thr Gly Leu Arg Val Ser Val Gly Leu Leu Leu 
35 40 45 



Val Lys Ser Val Gin Val Lys Leu Gly Asp Ser Trp Asp Val Lys Leu 
50 55 60 



Gly Ala Leu Gly Gly Asn Thr Gin Glu Val Thr Leu Gin Pro Gly Glu 
65 70 75 80 



Tyr He Thr Lys Val Phe Val Ala Phe Gin Ala Phe Leu Arg Gly Met 
85 90 95 



Val Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr Phe Gly Lys Leu Asp 
100 105 110 



Gly Gin He Ser Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val Leu Val 
115 120 125 



Gly He Tyr Gly Gin Tyr Gin Leu Leu Gly He Lys Ser He Gly Phe 
130 135 140 



Glu Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro Val Asn 
145 150 155 160 



Leu Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
165 170 



<210> 


219 


<211> 


171 


<212> 


PRT 


<213> 


Homo sapien 


<220> 




<221> 


MISC FEATURE 


<222> 


(6) .7(6) 


<223> 


Xsany amino acid 


<400> 


219 



His Ala Arg Ala Ala Xaa Cys Asp Gly Ser Pro Gly Gin Val Gin Ser 
15 10 15 



Gin Ket Tyr Gly Pro Gly Gly Gly Lys Tyr Phe Ser Thr Thr Glu Asp 
20 25 30 



Tyr Asp His Glu He Thr Gly Leu Arg Val Ser Val Gly Leu Leu Leu 
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35 40 45 



Val Lys Ser Val Gin Val Lys Leu Gly Asp Ser Trp Asp Val Lys Leu 
50 55 60 



Gly Ala Leu Gly Gly Asn Thr Gin Glu Val Thr Leu Gin Pro Gly Glu 
65 70 75 80 



Tyr lie Thr Lys Val Phe Val Ala Phe Gin Ala Phe Leu Arg Gly Met 
85 90 95 



Val Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr Phe Gly Lys Leu Asp 
100 105 110 



Gly Gin lie Ser Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val Leu Val 
115 120 125 



Gly lie Tyr Gly Gin Tyr Gin Leu Leu Gly He Lys Ser lie Gly Phe 
130 135 140 



Glu Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro Val Asn 
145 150 155 160 



Leu Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
165 170 



<210> 220 

<211> 156 

<212> PRT 

<213> Homo sapien 

<400> 220 

Met Val Leu Asp Ser Leu His Pro Gly Lys Glu Asp Gly Gly Ala Glu 
15 10 15 



Asp Pro Gly Cys Ala Gly Pro Ser Gin He Trp Thr Ser Lys Ala Leu 
20 25 30 



Pro Leu Ser Ser Val Gin Val Lys Leu Gly Asp Ser Trp Asp Val Lys 
35 40 45 



Leu Gly Ala Leu Gly Gly Asn Thr Gin Glu Val Thr Leu Gin Pro Gly 
50 55 60 



Glu Tyr He Thr Lys Val Phe Val Ala Phe Gin Ala Phe Leu Arg Gly 
65 70 75 80 
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Met. Val Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr Phe Gly Lys Leu 
85 90 95 



Asp Gly Gin lie Ser Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val Leu 
100 105 110 



Val Gly He Tyr Gly Gin Tyr Gin Leu Leu Gly He Lys Ser He Gly 
115 120 125 



Phe Glu Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro Val 
130 135 140 



Asn Leu Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
150 155 



145 




<210> 


221 


<211> 


156 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


221 



Trp Cys Trp Thr Leu Cys He Pro Gly Arg Arg Met Gly Ala Leu Arg 
15 10 15 



Thr Arg Asp Val Leu Gly His Pro Arg Ser Gly Arg Pro Lys Leu Cys 
20 25 30 



Leu Ser Pro Ser Val Gin Val Lys Leu Gly Asp Ser Trp Asp Val Lys 
35 40 45 



Leu Gly Ala Leu Gly Gly Asn Thr Gin Glu Val Thr Leu Gin Pro Gly 
50 55 60 



Glu Tyr He Thr Lys Val Phe Val Ala Phe Gin Ala Phe Leu Arg Gly 
65 70 75 80 



Met Val Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr Phe Gly Lys Leu 
85 90 95 



Asp Gly Gin He Ser Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val Leu 
100 105 110 



Val Gly He Tyr Gly Gin Tyr Gin Leu Leu Gly He Lys Ser He Gly 
115 120 125 



Phe Glu Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro Val 
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130 135 140 



Asn Leu Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
145 150 155 



<210> 222 

<211> 76 

<212> PRT 

<213> Homo sapien 

<400> 222 

Met Val Met Tyr Thr Ser Lys Asp Arg Tyr Phe Tyr Phe Gly Lys Leu 
15 10 15 



Asp Gly Gin lie Ser Ser Ala Tyr Pro Ser Gin Glu Gly Gin Val Leu 
20 25 30 



Val Gly lie Tyr Gly Gin Tyr Gin Leu Leu Gly lie Lys Ser He Gly 
35 40 45 



Phe Glu Trp Asn Tyr Pro Leu Glu Glu Pro Thr Thr Glu Pro Pro Val 
50 55 60 



Asn Leu Thr Tyr Ser Ala Asn Ser Pro Val Gly Arg 
65 70 75 



<210> 223 

<211> 139 

<212> PRT 

<213> Homo sapien 

<400> 223 

Leu Cys Arg Gly Gin Lys Glu Ser Ser Thr Thr Pro Ser Glu Val Leu 
15 10 15 



Trp He Ser Val Pro Val Pro Gin Ser Leu Lys Ser Gin Ala Ser Arg 
20 25 30 



Pro Pro Leu Pro Thr Val Pro His Pro Arg Pro Thr Gin Arg Ala Ser 
35 40 45 



Ala Gly His Ser Val Pro Gly Phe Ser Glu Cys Ser Arg Gly Leu Arg 
50 55 60 



Glu Ala Thr His Ser Ser He His Ser Ala Asn He Cys Gin Gly Arg 
65 70 75 80 
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Val Leu Thr Arg Leu Ala Trp His Trp Gly Tyr Lys Glu Glu Ala Arg 
85 90 95 



Phe Gin Leu Ser Ala Tyr Thr Leu Trp Trp Gly Leu Val Gin Arg Gin 
100 105 110 



lie Val Ala Val His Phe Ala He Cys Met Asp Gly Asp Thr Cys Arg 
115 120 125 



Ser Leu Cys Val Gly Thr Cys Pro Glu Val Arg 
130 135 



<210> 224 

<211> 568 

<212> PRT 

<213> Homo sapien 

<400> 224 

Met Val Lys Leu Ala Lys Ala Gly Lys Asn Gin Gly Asp Pro Lys Lys 
15 10 15 



Met Ala Pro Pro Pro Lys Glu Val Glu Glu Asp Ser Glu Asp Glu Glu 
20 25 30 



Met Ser Glu Asp Glu Glu Asp Asp Ser Ser Gly Glu Glu Val Val He 
35 40 45 



Pro Gin Lys Lys Gly Lys Lys Ala Ala Ala Thr Ser Ala Lys Lys Val 
50 55 60 



Val Val Ser Pro Thr Lys Lys Val Ala Val Ala Thr Pro Ala Lys Lys 
65 70 75 80 



Ala Ala Val Thr Pro Gly Lys Lys Ala Ala Ala Thr Pro Ala Lys Lys 
85 90 95 



Thr Val Thr Pro Ala Lys Ala Val Thr Thr Pro Gly Lys Lys Gly Ala 
100 105 110 



Thr Pro Gly Lys Ala Leu Val Ala Thr Pro Gly Lys Lys Gly Ala Ala 
115 120 125 



He Pro Ala Lys Gly Ala Lys Asn Gly Lys Asn Ala Lys Lys Glu Asp 
130 135 140 



Ser Asp Glu Glu Glu Asp Asp Asp Ser Glu Glu Asp Glu Glu Asp Asp 
145 150 155 160 
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Glu Asp Glu Asp Glu Asp Glu Asp Glu He Glu Pro Ala Ala Met Lys 
165 170 175 



Ala Ala Ala Ala Ala Pro Ala Ser Glu Asp Glu Asp Asp Glu Asp Asp 
180 185 190 



Glu Asp Asp Glu Asp Asp Asp Asp Asp Glu Glu Asp Asp Ser Glu Glu 
195 200 205 



Glu Ala Met Glu Thr Thr Pro Ala Lys Gly Lys Lys Ala Ala Lys Val 
210 215 220 



Val Pro Val Lys Ala Lys Asn Val Ala Glu Asp Glu Asp Glu Glu Glu 
225 230 235 240 



Asp Asp Glu Asp Glu Asp Asp Asp Asp Asp Glu Asp Asp Glu Asp Asp 
245 ' 250 255 



Asp Asp Glu Asp Asp Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Pro 
260 265 270 



Val Lys Glu Ala Pro Gly Lys Arg Lys Lys Glu Met Ala Lys Gin Lys 
275 280 285 



Ala Ala Pro Glu Ala Lys Lys Gin Lys Val Glu Gly Thr Glu Pro Thr 
290 295 300 



Thr Ala Phe Asn Leu Phe Val Gly Asn Leu Asn Phe Asn Lys Ser Ala 
305 310 315 320 



Pro Glu Leu Lys Thr Gly He Ser Asp Val Phe Ala Lys Asn Asp Leu 
325 330 335 



Ala Val Val Asp Val Arg He Gly Met Thr Arg Lys Phe Gly Tyr Val 
340 345 350 



Asp Phe Glu Ser Ala Glu Asp Leu Glu Lys Ala Leu Glu Leu Thr Gly 
355 360 365 



Leu Lys Val Phe Gly Asn Glu He Lys Leu Glu Lys Pro Lys Gly Lys 
370 375 380 



Asp Ser Lys Lys Glu Arg Asp Ala Arg Thr Leu Leu Ala Lys Asn Leu 
385 390 395 400 
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Pro Tyr Lys Vai Thr Gin Asp Glu Leu Lys Glu Val Phe Glu Asp Ala 
405 410 415 



Ala Glu He Arg Leu Val Ser Lys Asp Gly Lys Ser Lys Gly He Ala 
420 425 430 



Tyr He Glu Phe Lys Thr Glu Ala Asp Ala Glu Lys Thr Phe Glu Glu 
435 440 445 



Lys Gin Gly Thr Glu He Asp Gly Arg Ser He Ser Leu Tyr Tyr Thr 
450 455 460 



Gly Glu Lys Gly Gin Asn Gin Asp Tyr Arg Gly Gly Lys Asn Ser Thr 
465 470 475 480 



Trp Ser Gly Glu Ser Lys Thr Leu Val Leu Ser Asn Leu Ser Tyr Ser 
485 490 495 



Ala Thr Glu Glu Thr Leu Gin Glu Val Phe Glu Lys Ala Thr Phe He 
500 505 510 



Lys Val Pro Gin Asn Gin Asn Gly Lys Ser Lys Gly Tyr Ala Phe He 
515 520 525 



Glu Phe Ala Ser Phe Glu Asp Ala Lys Glu Ala Leu Asn Ser Cys Asn 
530 535 540 



Lys Arg Glu He Glu Gly Arg Ala He Arg Leu Glu Ala Arg Arg Leu 
545 550 555 560 



Pro Arg Arg Gin Arg Arg Arg Arg 
565 



<210> 225 

<211> 520 

<212> PRT 

<213> Homo sapien 

<400> 225 

Met Val Lys Leu Ala Lys Ala Gly Lys Asn Gin Gly Asp Pro l#ys Lys 
15 10 15 



Met Ala Pro Pro Pro Lys Glu Val Glu Glu Asp Ser Glu Asp Glu Glu 
20 25 30 



Met Ser Glu Asp Glu Glu Asp Asp Ser Ser Gly Glu Glu Val Val He 
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35 



40 



45 



Pro Gin Lys Lys Gly Lys Lys Ala Ala Ala Thr Ser Ala Lys Lys Val 
50 55 60 



Val Val Ser Pro Thr Lys Lys Val Ala Val Ala Thr Pro Ala Lys Lys 
65 70 75 80 



Ala Ala Val Thr Pro Gly Lys Lys Ala Ala Ala Thr Pro Ala Lys Lys 
85 90 95 



Thr Val Thr Pro Ala Lys Ala Val Thr Thr Pro Gly Lys Lys Gly Ala 
100 105 110 



Thr Pro Gly Lys Ala Leu Val Ala Thr Pro Gly Lys Lys Gly Ala Ala 
115 120 125 



He Pro Ala Lys Gly Ala Lys Asn Gly Lys Asn Ala Lys Lys Glu Asp 
130 135 140 



Ser Asp Glu Glu Glu Asp Asp Asp Ser Glu Glu Asp Glu Glu Asp Asp 
145 150 155 160 



Glu Asp Glu Asp Glu Asp Glu Asp Glu He Glu Pro Ala Ala Met Lys 
165 170 175 



Ala Ala Ala Ala Ala Pro Ala Ser Glu Asp Glu Asp Asp Glu Asp Asp 
180 185 190 



Glu Asp Asp Glu Asp Asp Asp Asp Asp Glu Glu Asp Asp Ser Glu Glu 
195 200 205 



Glu Ala Met Glu Thr Thr Pro Ala Lys Gly Lys Lys Ala Ala Lys Val 
210 215 220 



Val Pro Val Lys Ala Lys Asn Val Ala Glu Asp Glu Asp Glu Glu Glu 
225 230 235 240 



Asp Asp Glu Asp Glu Asp Asp Asp Asp Asp Glu Asp Asp Glu Asp Asp 
245 250 255 



Asp Asp Glu Asp Asp Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Pro 
260 265 270 



Val Lys Glu Ala Pro Gly Lys Arg Lys Lys Glu Met Ala Lys Gin Lys 
275 280 285 
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Ala Ala Pro Glu Ala Lys Lys Gin Lys Val Glu Gly Thr Glu Pro Thr 
290 295 300 



Thr Ala Phe Asn Leu Phe Val Gly Asn Leu Asn Phe Asn Lys Ser Ala 
305 310 315 320 



Pro Glu Leu Lys Thr Gly lie Ser Asp Val Phe Ala Lys Asn Asp Leu 
325 330 335 



Ala Val Val Asp Val Arg He Gly Met Thr Arg Lys Phe Gly Tyr Val 
340 345 350 



Asp Phe Glu Ser Ala Glu Asp Leu Glu Lys Ala Leu Glu Leu Thr Gly 
355 360 365 



Leu Lys Val Phe Gly Asn Glu He Lys Leu Glu Lys Pro Lys Gly Lys 
370 375 380 



Asp Ser Lys Lys Glu Arg Asp Ala Arg Thr Leu Leu Ala Lys Asn Leu 
385 390 395 400 



Pro Tyr Lys Val Thr Gin Asp Glu Leu Lys Glu Val Phe Glu Asp Ala 
405 410 415 



Ala Glu He Arg Leu Val Ser Lys Asp Gly Lys Ser Lys Gly He Ala 
420 425 430 



Tyr He Glu Phe Lys Thr Glu Ala Asp Ala Glu Lys Thr Phe Glu Glu 
435 440 445 



Lys Gin Gly Thr Glu He Asp Gly Arg Ser He Ser Leu Tyr Tyr Thr 
450 455 460 



Gly Glu Lys Gly Gin Asn Gin Asp Tyr Arg Gly Gly Lys Asn Ser Thr 
465 470 475 480 



Trp Ser Gly Glu Ser Lys Thr Leu Val Leu Ser Asn Leu Ser Tyr Ser 
485 490 495 



Ala Thr Glu Glu Thr Leu Gin Glu Val Phe Glu Lys Ala Thr Phe He 
500 505 510 



Lys Val Pro Arg Pro Arg Pro Arg 
515 520 
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<210> 226 

<211> 526 

<212> PRT 

<213> Homo sapien 

<400> 226 

Met Leu Arg Leu Pro Thr Val Phe Arg Gin Met Arg Pro Val Ser Arg 
15 10 15 



Val Leu Ala Pro His Leu Thr Arg Ala Tyr Ala Lys Asp Val Lys Phe 
20 25 30 



Gly Ala Asp Ala Arg Ala Leu Met Leu Gin Gly Val Asp Leu Leu Ala 
35 40 45 



Asp Ala Val Ala Val Thr Met Gly Pro Lys Gly Arg Thr Val lie lie 
50 55 60 



Glu Gin Ser Trp Gly Ser Pro Lys Val Thr Lys Asp Gly Val Thr Val 
65 70 75 80 



Ala Lys Ser lie Asp Leu Lys Asp Lys Tyr Lys Asn He Gly Ala Lys 
85 90 95 



Leu Val Gin Asp Val Ala Asn Asn Thr Asn Glu Glu Ala Gly Asp Gly 
100 105 110 



Thr Thr Thr Ala Thr Val Leu Ala Arg Ser He Ala Lys Glu Gly Phe 
115 120 125 



Glu Lys He Ser Lys Gly Ala Asn Pro Val Glu He Arg Arg Gly Val 
130 135 140 



Met Leu Ala Val Asp Ala Val He Ala Glu Leu Lys Lys Gin Ser Lys 
145 150 155 160 



Pro Val Thr Thr Pro Glu Glu He Ala Gin Val Ala Thr He Ser Ala 
165 170 175 



Asn Gly Asp Lys Glu He Gly Asn He He Ser Asp Ala Met Lys Lys 
180 IftS 190 



Val Gly Arg Lys Gly Val He Thr Val Lys Asp Gly Lys Thr Leu Asn 
195 200 205 



Asp Glu Leu Glu He He Glu Gly Met Lys Phe Asp Arg Gly Tyr He 
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296 

215 220 



Ser Pro Tyr Phe lie Asn Thr Ser Lys Gly Gin Lys Cys Glu Phe Gin 
225 230 235 240 



Asp Ala Tyr Val Leu Leu Ser Glu Lys Lys He Ser Ser He Gin Ser 
245 250 255 



He Val Pro Ala Leu Glu He Ala Asn Ala His Arg Lys Pro Leu Val 
260 265 270 



He He Ala Glu Asp Val Asp Gly Glu Ala Leu Ser Thr Leu Val Leu 
275 280 285 



Asn Arg Leu Lys Val Gly Leu Gin Val Val Ala Val Lys Ala Pro Gly 
290 295 300 



Phe Gly Asp Asn Arg Lys Asn Gin Leu Lys Asp Met Ala He Ala Thr 
305 310 315 320 



Gly Gly Ala Val Phe Gly Glu Glu Gly Leu Thr Leu Asn Leu Glu Asp 
325 330 335 



Val Gin Pro His Asp Leu Gly Lys Val Gly Glu Val He Val Thr Lys 
340 345 350 



Asp Asp Ala Met Leu Leu Lys Gly Lys Gly Asp Lys Ala Gin He Glu 
355 360 365 



Lys Arg He Gin Glu He He Glu Gin Leu Asp Val Thr Thr Ser Glu 
370 375 380 



Tyr Glu Lys Glu Lys Leu Asn Glu Arg Leu Ala Lys Leu Ser Asp Gly 
390 395 400 



Val Ala Val Leu Lys Val Gly Gly Thr Ser Asp Val Glu Val Asn Glu 
405 410 415 



Lys Lys Asp Arg Val Thr Asp Ala Leu Asn Ala Thr Arg Ala Ala Val 
420 425 430 



Glu Glu Gly He Val Leu Gly Gly Gly Cys Ala Leu Leu Arg Cys He 
435 440 445 



Pro Ala Leu Asp Ser Leu Thr Pro Ala Asn Glu Asp Gin Lys He Gly 
450 455 
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He Glu He He Lys Arg Thr Leu Lys He Pro Ala Met Thr He Ala 
465 470 475 480 



Lys Asn Ala Gly Val Glu Gly Ser Leu He Val Glu Lys He Met Gin 
485 490 495 



Ser Ser Ser Glu Val Gly Tyr Asp Ala Met Ala Gly Asp Phe Val Asn 
500 505 510 



520 525 





515 


<210> 


227 


<211> 


121 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


227 



Gin Cys Asp Gly Phe Ala Ala Glu Val Ser Thr Val His Glu He Leu 
15 10 15 



Cys Lys Leu Ser Leu Glu Gly Asp His Ser Thr Pro Pro Ser Ala Tyr 
20 25 30 



Gly Ser Val Lys Ala Tyr Thr Asn Phe Asp Ala Glu Arg Asp Ala Leu 
35 40 45 



Asn He Glu Thr Ala He Lys Thr Lys Glu Ala Val Asp Glu Val Thr 
50 55 60 



He Val Asn He Leu Thr Asn Arg Ser Asn Ala Gin Arg Gin Asp He 
65 70 75 80 



Ala Phe Ala Tyr Gin Arg Arg Thr Lys Lys Glu Leu Ala Ser Ala Leu 
85 90 95 



Lys Ser Ala Leu Ser Gly His Leu Glu Thr Val He Leu Gly Leu Leu 
100 105 110 



Lys Thr Pro Ala Gin Tyr Asp Ala Ser 
115 120 



<210> 228 

<211> 71 

<212> PRT 

<213> Homo sapien 
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<400> 228 



Asn Ser His Gin Asp Gin Arg Gly Val Asp Glu Val Thr He Val Asn 
15 10 15 



He Leu Thr Asn Arg Ser Asn Ala Gin Arg Gin Asp He Ala Phe Ala 
20 25 30 



Tyr Gin Arg Arg Thr Lys Lys Glu Leu Ala Ser Ala Leu Lys Ser Ala 
35 40 45 



Leu Ser Gly His Leu Glu Thr Val He Leu Gly Leu Leu Lys Thr Pro 
50 55 60 



Ala Gin Tyr Asp Ala Ser Glu 
65 70 



<210> 229 

<211> 242 

<212> PRT 

<213> Homo sapien 

<400> 229 

Met Leu Glu Arg Arg Ser Val Met Asp Val Val Ala Ala Glu Gly Arg 
15 10 15 



Ser Gin Leu Ser Ala His Gly Pro Ala Ser Phe Lys Met Ser Thr Val 
20 25 30 



His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Asp His Ser Thr Pro 
35 40 45 



Pro Ser Ala Tyr Gly Ser Val Lys Ala Tyr Thr Asn Phe Asp Ala Glu 
50 55 60 



Arg Asp Ala Leu Asn He Glu Thr Ala He Lys Thr Lys Gly Val Asp 
65 70 75 80 



Glu Val Thr He Val Asn He Leu Thr Asn Arg Ser Asn Ala Gin Arg 
85 90 95 



Gin Asp He Ala Phe Ala Tyr Gin Arg Arg Thr Lys Lys Glu Leu Ala 
100 105 110 



Ser Ala Leu Lys Ser Ala Leu Ser Gly His Leu Glu Thr Val He Leu 
115 120 125 
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Gly Leu Leu Lys Thr Pro Ala Gin Tyr Asp Ala Ser Glu Leu Cys Ser 
130 135 140 



Arg Thr Asn Gin Glu Leu Gin Glu lie Asn Arg Val Tyr Lys Glu Met 
145 150 155 160 



Tyr Lys Thr Asp Leu Glu Lys Asp lie He Ser Asp Thr Ser Gly Asp 
165 170 175 



Phe Arg Lys Leu Met Val Ala Leu Ala Lys Gly Arg Arg Ala Glu Asp 
180 185 190 



Gly Ser Val lie Asp Tyr Glu Leu He Asp Gin Asp Ala Arg Asp Leu 
195 200 205 



Tyr Asp Ala Gly Val Lys Arg Val Lys Arg Lys Gly Thr Asp Val Pro 
210 215 220 



Lys Trp He Ser He Met Thr Glu Arg Ser Val Ala Pro Pro Pro Glu 
225 230 235 240 



Ser He 



<210> 230 

<211> 342 

<212> PRT 

<213> Homo sapien 

<400> 230 

Trp He Val Val Ala Ala Glu Gly Arg Ser Gin Leu Ser Ala His Gly 
15 10 15 



Pro Ala Ser Phe Lys Met Ser Thr Val His Glu He Leu Cys Lys Leu 
20 25 30 



Ser Leu Glu Gly Asp His Ser Thr Pro Pro Ser Ala Tyr Gly Ser Val 
35 40 45 



Lys Ala Tyr Thr Asn Phe Asp Ala Glu Arg Asp Ala Leu Asn He Glu 
50 55 60 



Thr Ala He Lys Thr Lys Gly Val Asp Glu Val Thr He Val Asn He 
65 70 75 80 



Leu Thr Asn Arg Ser Asn Ala Gin Arg Gin Asp He Ala Phe Ala Tyr 
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300 
90 



95 



Gin Arg Arg Thr Lys Lys Glu Leu Ala Ser Ala Leu Lys Sex Ala Leu 
100 105 110 



Ser Gly His Leu Glu Thr Val He Leu Gly Leu Leu Lys Thr Pro Ala 
115 120 125 



Gin Tyr Asp Ala Ser Glu Leu Cys Ser Arg Thr Asn Gin Glu Leu Gin 
130 135 140 



Glu He Asn Arg Val Tyr Lys Glu Met Tyr Lys Thr Asp Leu Glu Lys 
145 ISO 155 160 



Asp He He Ser Asp Thr Ser Gly Asp Phe Arg Lys Leu Met Val Ala 
165 170 175 



Leu Ala Lys Gly Arg Arg Ala Glu Asp Gly Ser Val He Asp Tyr Glu 
180 185 190 



Leu He Asp Gin Asp Ala Arg Asp Leu Tyr Asp Ala Gly Val Lys Arg 
195 200 205 



Lys Gly Thr Asp Val Pro Lys Trp He Ser He Met Thr Glu Arg Ser 
210 215 220 



Val Pro His Leu Gin Lys Val Phe Asp Arg Tyr Lys Ser Tyr Ser Pro 
225 230 235 240 



Tyr Asp Met Leu Glu Ser He Arg Lys Glu Val Lys Gly Asp Leu Glu 
245 250 255 



Asn Ala Phe Leu Asn Leu Val Gin Cys He Gin Asn Lys Pro Leu Tyr 
260 265 270 



Phe Ala Asp Arg Leu Tyr Asp Ser Met Lys Gly Lys Gly Thr Arg Asp 
275 280 285 



Lys Val Leu He Arg He Met Val Ser Arg Ser Glu Val Asp Met Leu 
290 295 300 



Lys He Arg Ser Glu Phe Lys Arg Lys Tyr Gly Lys Ser Leu Tyr Tyr 
305 310 315 320 



Tyr He Gin Gin Asp Thr Lys Gly Asp Tyr Gin Lys Ala Leu Leu Tyr 
325 330 335 
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Leu Cys Gly Gly Asp Asp 
340 



<210> 231 

<211> 72 

<212> PRT 

<213> Homo sapien 

<400> 231 



Pro Arg Pro Leu Leu Ala Arg Arg Tyr Leu Cys Arg Val Thr Ser Cys 
15 10 15 



Phe Leu Ser Leu Ser Arg Ala Val Trp Trp Gin Gin Ala Gin Pro Gin 
20 25 30 



Ala Gin Ala Gin Pro Arg Asn Ala Glu Arg Arg Arg Arg Val Arg Gly 
35 40 45 



Pro Val Arg Ala Ala Glu Met Arg Pro Leu Ala He Ala Ser Ser Val 

60 



50 


55 


Pro Arg Thr Thr His Pro Ser Arg 


65 


70 


<210> 


232 


<211> 


103 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


232 


Leu Leu Pro Phe Ser Leu Ala Arg Gly Val 


1 


5 10 



15 



Gly Ala Pro Ser Leu Glu Met Gin Asn Asp Ala Gly Glu Phe Val Asp 
20 25 30 



Leu Tyr Val Pro Arg Lys Cys Ser Ala Ser Asn Arg He He Gly Ala 
35 40 45 



Lys Asp His Ala Ser He Gin Met Asn Val Ala Glu Val Asp Lys Val 
50 55 60 



Thr Gly Arg Phe Asn Gly Gin Phe Lys Thr Tyr Ala He Cys Gly Ala 
65 70 75 80 



He Arg Arg Met Gly Glu Ser Asp Asp Ser He Leu Arg Leu Ala Lys 
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85 90 95 



Ala Asp Gly lie Val Ser Lys 
100 



<210> 


233 




<211> 


112 




<212> 


PRT 




<213> 


Homo 


sapien 


<400> 


233 




Leu Leu Pro 


Phe Ser 


1 




5 



10 15 



Gly Ala Pro Ser Leu Glu Met Gin Asn Asp Ala Gly Glu Phe Val Asp 
20 25 30 



Leu Tyr Val Pro Arg Lys Cys Ser Ala Ser Asn Arg He He Gly Ala 
35 40 45 



Lys Asp His Ala Ser He Gin Met Asn Val Ala Glu Val Asp Lys Val 
50 55 60 



Thr Gly Arg Phe Asn Gly Gin Phe Lys Thr Tyr Ala He Cys Gly Ala 
65 70 75 80 



He Arg Arg Met Val Ser Val Ser Leu Gly Phe Ala His His Phe Gly 
85 90 95 



Thr Ser Trp Thr Leu Pro Cys Ala Leu Glu Cys Val Met Val Pro Glu 
100 105 110 



<210> 234 

<211> 87 

<212> PRT 

<213> Homo sapien 

<400> 234 

Ala Arg Gly He Ala Arg Gly Val Val Ala Ala Gly Ala Ala Gly Ala 
1 5 10 15 



Gly Pro Ala Ser Lys Cys Arg Thr Thr Pro Ala Ser Ser Trp Thr Cys 
20 25 30 



Thr Cys Arg Gly Asn Ala Ser Ala Ser Asn Arg He He Gly Ala Lys 
35 40 45 
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Asp His Ala Ser He Gin Met Asn Val Ala Glu Val Ser Trp Glu Pro 
50 55 60 



Gly Arg Arg Glu Gly Cys Asp He Cys Ala Gly Lys Ala Gly Cys Pro 
65 70 75 80 



He Val Glu Glu Pro Leu Gly 
85 



<210> 235 

<211> 86 

<212> PRT 

<213> Homo sapien 

<400> 235 

Ala Arg Gly He Ala Arg Gly Val Val Ala Ala Gly Ala Ala Gly Ala 
15 10 15 



Pro Ser Leu Glu Met Gin Asn Asp Ala Gly Glu Phe Val Asp Leu Tyr 
20 25 30 



Val Pro Arg Lys Cys Ser Ala Ser Asn Arg He He Gly Ala Lys Asp 
35 40 45 



His Ala Ser He Gin Met Asn Val Ala Glu Val Ser Trp Glu Pro Gly 
50 55 60 



Arg Arg Glu Gly Cys Asp He Cys Ala Gly Lys Ala Gly Cys Pro He 
65 70 75 80 



Val Glu Glu Pro Leu Gly 
85 



<210> 236 

<211> 77 

<212> PRT 

<213> Homo sapien 

<400> 236 

Met Arg Gly Arg Gly Arg Gly Thr Cys Arg Gly Asn Ala Ser Ala Ser 
1 5 10 15 



Asn Arg He He Gly Ala Lys Asp His Ala Ser He Gin Met Asn Val 
20 25 30 



Ala Glu Val Asp Lys Val Thr Gly Arg Phe Asn Gly Gin Phe Lys Thr 
35 40 45 
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Tyr Ala lie Cys Gly Ala He Arg Arg Met Gly Glu Ser Asp Asp Ser 
50 55 60 



He Leu Arg Leu Ala Lys Ala Asp Gly He Val Ser Lys 
65 70 75 



<210> 237 

<211> 86 

<212> PRT 

<213> Homo sapien 

<400> 237 

He Met Pro Ser Gly Ala Ser Val Met Asp Ala Trp Ser Arg Pro Arg 
15 10 15 



Tyr Val Pro Arg Lys Cys Ser Ala Ser Asn Arg He He Gly Ala Lys 
20 25 30 



Asp His Ala Ser He Gin Met Asn Val Ala Glu Val Asp Lys Val Thr 
35 40 45 



Gly Arg Phe Asn Gly Gin Phe Lys Thr Tyr Ala He Cys Gly Ala He 
50 55 60 



Arg Arg Met Gly Glu Ser Asp Asp Ser He Leu Arg Leu Ala Lys Ala 
65 70 75 80 



Asp Gly He Val Ser Lys 
65 



<210> 


238 


<211> 


21 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic 


<400> 


238 


tacgcagagc tcatcgtcct t 


<210> 


239 


<211> 


22 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Synthetic 



21 



<400> 239 
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305 

22 



<210> 240 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic 



<400> 240 

tggctgagct cttacctggt tttcaggc 



28 



